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A 150-me Personal Radio Signaling System 


By DOREN MITCHELL and K. G. VAN WYNEN 


(Manuscript received February 23, 1961) 


An experimental 150-me personal signaling system has been set up in 
New York City to evaluate over-all technical performance and explore sub- 
scriber reactions to the system. The system includes pocket receivers equipped 
with tuned reeds and central office arrangements adapted for direct customer 
dialing. This paper describes the system in over-all terms, and tells how it 
was engineered. It also compares this system with the 35-me systems and 
discusses traffic and radio transmission problems. 


I. INTRODUCTION 


An experimental trial of a 150-me personal radio signaling system 
known as the Bellboy system was started in New York City in October 
1960. The service given to customers is very similar to that provided 
by 35-me telephone company systems now operating in several smaller 
cities.'”"* The new system, however, has a much greater traffic capacity, 
uses receivers of greater selectivity and sensitivity, and includes direct 
customer dialing. 

Basically, this system extends the telephone bell to the customer’s 
pocket. Each customer carries a pocket radio receiver containing reeds 
tuned to specific voice frequencies. This allows him to be selectively 
signaled. Three transmitters are installed in the lower part of Man- 
hattan to give coverage to practically all locations below 59th Street. 
Central office arrangements are provided that accept calls that are 
dialed in and, in response, cause the specific signal of the customer who 
is being called to be radiated three times on the air. 
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When this signal reaches the customer’s set, it causes the set to emit 
an alerting signal. The customer now knows that his office has received 
an important call and he goes to the nearest telephone and calls back 
to get his message. 

The systems now in service at 35 me give, in general, this same kind 
of service. As will be pointed out later in this paper, however, the limita- 
tions of the 35-me system, which include receivers of lower selectivity 
and sensitivity and slower signaling speed, severely limit its usage in 
big cities. In addition, there is a more fundamental problem arising 
from the limited number of frequency allocations available at 35 me, 
preventing growth of 35-me systems to more than a few hundred cus- 
tomers in many small cities. These were the main reasons for proceeding 
with the trial of the 150-me system. Some comparisons of the environ- 
ments of the two systems are given in Section IIT. 

An interesting general feature of this experimental trial is that some 
of the receivers were provided with replaceable batteries and some with 
rechargeable batteries. The latter type of receiver requires that the bat- 
tery be recharged every night in a desk-top unit provided for that pur- 
pose, but this removes the necessity of supplying a fresh battery to the 
customer every few weeks. 

In the early planning of this project it was apparent that there were 
two major technical problems. One was the need for a highly sensitive 
pocket receiver with a code-responsive device. The other was the need 
for more accurate knowledge of radio transmission loss into buildings. 

A very sensitive, stable, and selective receiver has been developed by 
Bell Telephone Laboratories to operate in the 150-me range. This re- 
ceiver will be described in more detail in a forthcoming paper, but some 
of its characteristics are covered later in this paper. A receiver capable 


ae ° ac , ° 4 
of giving service at 35 me has also been developed by an outside supplier. 


In order to obtain knowledge of radio transmission loss into buildings 
an extensive test program was set up early in this project. These meas- 
urements have been covered in an earlier paper by Rice.” 


II. OVER-ALL SYSTEM PLAN 


Operation of the Bellboy system can be understood by tracing the 
path of a call as it progresses through the various parts of the system 
that are shown in Fig. 1. When there is an urgent need to reach a cus- 
tomer who is out of the office, his secretary dials his number, which 
might be, for example, 225-4654, and listens. She hears a momentary 
ringback, followed by a warbling tone, which signifies that the number 
has been received, checked, and stored in the central office equipment. 
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Fig. 1 Over-all plan of operation. 


She may now hang up, but if instead she listens a few seconds longer 
she will hear a verbal statement that her Bellboy system number will 
be signaled. Within a period ranging from a few seconds to about one 
minute, the signal is put on the air and the customer’s set emits a steady 
beep. The customer now depresses a button to stop the beep, goes to 
the nearest telephone (in this case, a pay station), and calls his office 
to get the message. 


2.1 Central Office Arrangements 


What actually happened right after the secretary dialed was typical 
of many dialed telephone calls. The first three digits of the number, 225, 
correspond to a central office designation and normally would represent 
the one which serves the called party with incoming service. In the 
Bellboy system the counterpart of the terminating central office is ac- 
tually the J-1 terminal which couples the wire plant to the radio portion 
of the system. All originating offices are wired to recognize the code 225 
and set up a proper route through a tandem office that has the ability 
to accept digits in any form. The tandem office then forwards the digits 


as dial pulses, since the present J-1 is arranged to accept only this type 


of signal. Fig. 2 illustrates the functions included in the J-1 terminal. 
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Fig. 2 Central office arrangements. 


When the last four digits, 4654, are spilled into the trunk they are 
received by dial-pulse registers. A number-checking circuit then verifies 
that the number is actually within the number blocks containing the 
valid codes. A connecting circuit next transfers the four digits from the 
register to a reed switch store on a two-out-of-five basis. 

This number, along with those due to additional calls, stays in storage 
until called for by the storage control circuit, which occurs when the radio 
transmitters are available. At this time the translator uses the informa- 
tion in the stores to connect three of the 32 tone generators in the 36A 
oscillator to the transmitter control circuit. There the tones are amplified 
and fed to the three radio transmitters in the trial system. All trans- 
mitters are turned on, and the presence of the modulating tones and 
RF carrier is verified by “sniffer” circuits at each one. A timing circuit 
then maintains the modulation for about 0.5 second, and then inserts 
a pause of 0.5 second before the next signal. Each individual code sig- 
nal is sent three times to guard against dead spots. 

In principle there are 4960 possible codes when three are chosen at a 
time from a universe of 32. In this system the translation from the four- 
digit decimal number assigned to each receiver to the three tones chosen 
from 32 is made more simple and economical by using only 3200 of the 
possible codes. More codes could be provided by introducing a more 
complicated translator or by increasing the number of tones. 

Bellboy system numbers are served on the basis of groups that arrive 
during time intervals which are slightly over one minute long. This 
arrangement permits an economical design, but results in minor delays, 


which can range from one second to somewhat over a minute, with an 
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average value of about one-half minute. A four-digit number spilled into 
the terminal is transmitted immediately if there is no other number in 
storage. That number retains exclusive use of the radio transmitters 
until it has been radiated the third time about 60 seconds later. Mean- 
while, arriving numbers must wait for service. Then they are radiated 
in sequence, each taking about one second to process. After the third 
transmission, each number is cleared from storage and, when the last 
to arrive has been cleared, the next waiting group is served in similar 
fashion. 

Fig. 3 is a time diagram illustrating the foregoing for a moderately 
loaded system. It is assumed that at time ¢ = 0 the control circuit is 
just starting to serve calls 1, 2, and 3 in group A. The third transmission 
will be completed 63 seconds later. During that interval calls 4 through 
7 arrive in storage and wait there for intervals ranging from 62 to 6 
seconds. At 64 seconds after ¢ = 0 call 4 is served. The figure also shows 
the arrival and service of group C. It will be clear that the maximum 
time interval between the arrival of a call in storage and the completion 
of the third transmission is (120 + n) seconds, where n is number of 
calls in the immediately preceding group. 

Each code that is transmitted comprises three audio tones, each being 
applied at a voltage that produces a carrier deviation of 1.3 ke. The 
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Fig. 3 Time diagram illustrating Bellboy system calls (not to exact seale) 
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tone sources are crystal-controlled transistorized square-wave genera- 
tors. They produce odd multiples of 7.5 eps, beginning at 517.5 eps and 
ending at 997.5. These square waves are then filtered to produce sine 
waves. Two frequency groups, a low and a high, are formed by omitting 
the center frequency, 757.5 eps. In the present system each code con- 
sists of two frequencies from one group and one frequency from the 
other. It will be clear that second-order products generated in the over- 
all system fall outside the present range of tones and would always fall 
between tones if the range were extended by adding more odd harmonics 
of 7.5 eps. Although some third-order products can fall on operating 
frequencies, they will be too low in level to introduce errors. 


2.2 Pocket Receiver 


The pocket receiver will be covered in some detail in a forthcoming 
paper, but a brief general description of it will be of interest. It is a 
highly sensitive and selective radio receiver using ten transistors and 
three vibrating reeds. Two views of a receiver are shown in Fig. 4. Reeds 
are available for each of 32 frequencies produced by the 32-tone genera- 
tor, and any three may be inserted into any receiver. 
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Fig. 4 150-me Bellboy pocket receiver. 
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The receiver proper consists of two stages of RF amplification at 150 
mc, an oscillator with diode mixer, and three stages of IF’. Six transistors 
are used for these functions, another is used for the limiter, and two for 
the amplifier to drive the reeds A final transistor provides audio signaling 
tone to a small transducer which produces the beep. An interesting fea- 
ture of the receiver is that it uses only one frequency conversion with the 
IF at about 5.5 ke. This feature has allowed an economical receiver design 
and provides very satisfactory operation for reed signaling. 

A subsequent paper will describe the reeds in more detail. Each is a 
highly stable tuning-fork structure with a bandwidth of one cycle, similar 
to those used in the trial of reed signaling in mobile radio at Richmond, 


ow é.7 mm Ae 8 
Virginia, some years ago." The new miniature reeds, however, occupy 


only about one-seventh the space, have about one-sixth the weight, and 
require one-fifth the power of those used at Richmond. 


2.3 Integrated Operation 


The operation of the terminal described above assumed that the radio 
channel was assigned solely to signaling, as was the case for the trial. 
The terminal, however, has been designed so that it can operate on a 
channel also used for mobile telephone service. In this “integration,” 
use of the radio channel is alternated between the two services, with mo- 
bile telephone service having priority. Bellboy service numbers which 
have been transmitted once may be required to wait more than the nor- 
mal interval if a telephone call originates before the second transmission. 
This can result in intervals of variable duration between first and second 
or second and third transmission. As this period of time increases, the 
customer might become confused as to whether he was receiving a repeti- 
tion or a new call. For this reason, a timing feature is provided in the 
control circuit to clear from storage any calls which have been served 
whenever this interval becomes excessively large. 


III, COMPARISON OF 35-MC AND 150-MC ENVIRONMENT 


It is well known that long-range “skip’’ interference has necessitated 
dividing the North American continent into zones, in each of which only 
one or two 35-me common carrier frequency allocations are made. The 
nature of skip interference is such that, although the frequency alloca- 
tion may be reused at distances up to a few hundred miles, it cannot be 
reused at distances from roughly 1000 to 3000 miles. Fig. 5 indicates this 
well-known zoning plan as it is used for 35-me mobile telephone service. 
In general, this means that a transmitter of reasonably high power, such 
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Fig. 5 — Common carrier frequency zoning plan for 30-44 me mobile telephone 
service. 


as 250 watts, must be assigned only to its own zone frequency. Otherwise, 
occasional skip interference from its radiation will cause serious inter- 
ference to mobile telephone receivers in distant zones. It is rather signifi- 
cant and of interest, however, that such interference, which amounts to 
free-space transmission, is usually no more than about 20 db above a 
usable signal. 

All Bellboy receivers are at least 20 db less sensitive than mobile tele- 
phone receivers because of their small antennas. Thus at 35 me there 
would probably be no interference problem into Bellboy receivers from 
a distant zone. On the other hand, it would not be allowable to transmit 
at 250 watts on a channel assigned to another zone because of interference 
into mobile radio receivers. 

teducing transmitter power 20 db or more does not appear to be a 
practical solution, since even with 250-watt transmitters and high-per- 
formance receivers the range into modern steel-reinforced buildings is 
only about one and one-half miles, and into big-city streets it is 
about five miles. (Section V covers these matters in more detail. ) 

These factors sum up to the conclusion that in the majority cf cases 
it is only practical to give Bellboy service at 35 me on the regular zone 
channel in any one area. This means that if it is desired to give such serv- 
ice in an area already having highway mobile telephone service, it is 
necessary to integrate Bellboy with the mobile service. This can be 
readily done, but it places a severe limitation on the possible number of 
customers and the grade of service. 
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However, there is an unexpected dividend from this restriction to using 
only the zone channel, which is illustrated in Fig. 6. In most zones, one of 
the interior channels is used, so that there will be no high-powered radia- 
tion in that zone on any nearby channels. This allows the use of a rela- 
tively nonselective receiver. A problem may arise, however, if there is 
potential interference from nearby channels assigned to other than com- 
mon carrier services. One example is the use of an edge channel, as shown 
in Fig. 6. 

These factors permit the satisfactory use of a super-regenerative re- 
ceiver to give service in many small to moderate-sized cities throughout 
the country. About 20 systems are now in service. Such a receiver is 
naturally substantially simpler and therefore cheaper than a more selec- 
tive receiver. As a matter of fact, it is apparent from Fig. 6 that in most 
situations there is no incentive to use a more selective receiver, as long 
as the present allocation plan is used in the 35-me frequency range. 

On the other hand, it is also apparent that, in order to provide ade- 


quate service in the areas where there are large numbers of potential 


customers, a system is needed that can operate where there are many 
more channels available. Fig. 7 indicates the situation in the 150-me 
band. Here there are 11 channels assigned to common carrier usage and, 
since there is no long-range interference problem, all these channels are 
available in any area. 

In some highly settled areas it may be necessary to use almost all of 
these channels for mobile telephones, thereby requiring that Bellboy 
service be given on an integrated basis. In many areas, even the thickly 
settled ones, however, it will be possible to set aside at least one clear 
channel for Bellboy. As pointed out in Section IV, one clear channel pro- 
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vides a large customer capacity. In addition, the 150-me system has a 
1-to-1 speed advantage due to the use of simultaneous tones instead of 
the sequential system used in the 35-me receiver. 

These advantages, however, are obtained at a considerable cost. As 
indicated in Fig. 7, the receiver selectivity must be good enough to allow 
adjacent-channel operation. This, of course, also implies a high degree 
of frequency stability. 


IV. TRAFFIC CONSIDERATIONS 


The central office equipment in the J-1 terminal is designed on a flexi- 
ble basis, so only the equipment needed for a particular situation will be 
installed. When fully equipped, the system will accomodate a maximum 


of 3200 four-digit numbers. These fall in the following number groups: 


1000-1999, 
2100-2699, 
3000-3999, 
1100-4699. 


The number of Bellboy customers, NV, that can be served from a single 
radio channel depends on the amount of traffic which they generate. 
Since the holding time, h, of each call is fixed at approximately three 
seconds by design, the calling rate during the busy hour, c, becomes the 
controlling factor. Early experience with the 35-me system indicates only 
0.4 call per customer per day and approximately 0.1 call per hour in the 
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busy hour. However, in the New York trial the corresponding figures 
were about three times larger. The following expression can be used to 
find the RF channel capacity: 


_ 3600 


N 
ch 


E, 
where F is an efficiency factor. Taking FE = 0.8, ¢ = 0.3, andh = 3 
seconds, it is found that 


(3600) (0.8 ) 
(0.3 )(3) 


= 3200. 
It thus appears that for this calling rate the radio channel would be fully 
loaded by the 3200 codes included in the present system. It is possible 
that the calling rate will vary from one city to another, depending on the 
characteristics of the principle users of the system. 

The number of registers and storage units needed is also affected by 
the calling rate. Registers can be installed on a unit basis, while stores 
come in blocks of 10. The present maximum is 10 registers and 40 stores. 

The holding time for an incoming trunk and register circuit depends 
somewhat on the customer. In cases where the call comes through tandem 
and the customer hangs up promptly upon hearing the warble tone signi- 
fying acceptance of the call, the holding time, h, , may be as short as 7 
seconds. In cities where the customer dials directly from a step-by-step 
office and waits for the complete voice announcement the holding time 
may increase to about 12 seconds. Taking the lower figure of 7 seconds 
and assuming c = 0.2 and N = 3200, the load offered the registers in the 
busy hour would be 

ch,N — (0.2)(7)(3200) 
a, = — = —— =: 1.24. 
3600 3600 

This average figure, which is derived by assuming that all calls come 

in at a uniform rate, is defined as the load in erlangs. Standard tables are 


. 9 . . . ° e 
available’ that show the grade of service for a given load when various 


numbers of the given device or facility are available. 

From such a table it is apparent that the provision of five registers 
would insure a good grade of service, in which the probability of being 
delayed due to lack of a trunk and register would be less than 0.01. As- 
sumption of a 12-second holding time would increase the load to 2.1 
erlangs. This would require seven registers for approximately the same 
grade of service. 

In order to develop the holding time for a storage unit, it is necessary to 
recognize that the basic time interval during which new calls arrive in 
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storage has a duration equal to the maximum service time for the im- 
mediately preceding group of calls. This is 60 seconds plus the time 
needed for the third transmission of the n old calls being serviced. Since 
each transmission of a call requires one second, the basic acceptance in- 
terval is (60 + n) seconds, during which n’ new calls arrive in storage. 
Since it is equally likely that new calls may appear at any time in this 
interval, the average value of the acceptance interval is thus (60 + n)/2 
seconds. Each new call has its third transmission completed in 60 seconds, 
and so the average call transmitting time is 
, 
60 + 4 
The average storage holding time is the sum of these two times, i.e., 
gb 4 Sale 
9 9 


; 


or about 90 + 7, where fi is assumed to be the average n. 

It is of interest to develop the load factor for stores but, in order to do 
so, we must determine 7% for various calling rates. The average number 
of stores needed, 7, depends on the calling rate and holding time, as be- 
fore. Substituting the value (3f + 7) for the holding time, where ¢ is the 
interval in seconds between successive transmissions, it is found that 


cN(3t +7) 
3600 


which is the same as the load expressed in erlangs. Solving gives 


3teN 


- erlangs. 
3600 — cA : 


This is not a rigorous solution, but is believed to be a reasonably close 
approximation. 

The following table shows the load offered to the storage system for 
oe 


3200 customers and various values of calling rate ¢ and time between 


signals ¢: 


ee Load offered, a, , in erlangs 
Calling rate 
in busy hour, ¢ 
t = 20 seconds t = 25 seconds = 30 seconds 


0.50 


0.20 
0.10 
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teference to delay curves shows that, with 3200 customers, 40 stores 
will be sufficient if the calling rate is 0.2, and be nearly sufficient for a 
calling rate of 0.3 to assure a delay probability of 0.01. If desirable, the 
load could be reduced by reducing the interval t between transmissions. 
The figure of 30 seconds has been used during the trial, but further ex- 
perience may indicate that this can be reduced somewhat. It should also 
be noted that the present system is more than adequate for a calling 
rate of 0.1, which has been found in systems operating in other cities. A 
more rigorous analysis of this case is beyond the scope of this paper. 

The tone generators are designed on an individual basis, each unit 
being a replaceable printed circuit board. In small installations it is pos- 
sible to substitute a resistance for some of the generators and equip only 
those required for the N codes in the initial system. The number needed 
is approximately 2(N* + 1) generators. A new installation having 300 
customers could start by equipping 16 generators. As growth occurs 
additional units can be inserted in the mounting rack. 

A study of the calling habits of customers in the New York trial was 
carried out by slaving a 20-pen recorder to the relays controlling connec- 
tion of the tone generators. These recordings furnished the data for the 
estimates that were used earlier for the average calling rates during the 


five-day working week. There is some calling on Saturday and Sunday, 


but at a greatly reduced rate. 


V. RADIO TRANSMISSION CONSIDERATIONS 


The key element in transmission performance of the Bellboy system is 
the pocket receiver. The 150-me receiver can trigger on an electrical in- 
put signal of about one microvolt and is thus comparable in electrical 
performance to receivers now being used for mobile telephone. In addi- 
tion, its selectivity is better than 80 db at a frequency 30 ke away from 
the carrier, which is also comparable to mobile telephone requirements. 

Any pocket receiver, however, suffers a very substantial loss in effec- 
tive sensitivity due to the fact that the antenna is small compared to a 
half wavelength. The antenna in this receiver is a ferrite stick, which has 
been found to give about the best performance of any tested so far. Even 
this antenna, however, has a loss of over 20 db compared to a half-wave 
dipole. 

One of the problems in engineering this system was to find a reliable 
and simple method of measuring the small receivers. Two types of meas- 
urement are important. One is to determine sensitivity in free space and 
the other to determine sensitivity when the receiver is carried in the 
normal manner near a person’s body. 
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Early tests were made in an open field at Holmdel, New Jersey, to 
“ ’ . 


determine free-space sensitivity, and these figures were then compared 


to typical sensitivity on the body. Sensitivity is defined as the field in 
decibels above 1 microvolt per meter required to just trigger the receiver. 

Fig. 8 shows the relative response of a typical receiver when worn in 
a man’s outer coat pocket as compared to its sensitivity in free space. 
As might be expected, there is quite a large variation depending on which 
way the receiver or the man with a receiver is facing relative to the trans- 
mitter. It is of interest that in some orientations the man provides gain. 
Apparently, a ferrite stick couples electrically to some extent to the body, 
thus effectively using the body as an antenna. In other orientations, how- 
ever, the body provides shielding. It has been assumed for engineering 
purposes that the average sensitivity of a receiver would be used for esti- 
mating range. This seems reasonable, since it appears equally probable 
that a person may be facing in any direction when he is signaled. It is 
of interest that the average sensitivity of this receiver is 35.8 db on the 
body and 35 db in free space. 

It is apparent that measuring receivers in an open field is a clumsy and 
time-consuming process, so experiments were made with various methods 
of measuring receivers in the laboratory. The most compact arrangement 
was a jig in which the receiver was placed in a plastic structure so that 
it coupled closely to a fixed pick-up coil. Although this method was satis- 
factory for rough checking, it was found that its accuracy left something 
to be desired. For this reason another method was worked out and is 
shown in Fig. 9. This involved the use of a short section of waveguide 
which is big enough to transmit 150 me with very little attenuation. Al- 
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Fig. 9 Method of measuring receivers using 150-me waveguide ‘‘bear trap.”’ 


though this arrangement, which was nicknamed the ‘bear trap,” was 
somewhat bulky, it could be set up in a moderate-sized room and was 
found to give reproducible results. 

As indicated, the receiver was placed in a piece of foam plastic about 
four feet away from the transmitting loop and in the center of the wave- 
guide. The combination of space cloth and metallic termination at the 
right turned out to be quite effective in holding down the standing wave 
ratio, which was about 1.2. It thus turned out that the receiver was in a 
fairly uniform vertically polarized electric field, and the measurements 
were not overly critical as to receiver position. Fig. 10 shows a correla- 
tion of free-space measurements in an open field and measurements in 
the “bear trap” for about 30 receivers. 

Using the figure of 36 db sensitivity, it is possible to predict range using 
the technique described by Rice.” 

The following table summarizes the figures used in this estimate in the 
same manner as those shown in Table IT of the Rice paper: 


Receiver sensitivity above 1 microvolt per meter: 36 db 
Corresponding minimum usable power in a whip 
antenna: —116 dbw 
{adio transmitter power (dipole) : 24 dbw 
Maximum allowable path loss (C — B): 140 db 
Building loss for 78 per cent reliability: 31 db 

- Equivalent maximum allowable median loss in 
streets (D — FE): 109 db 


For convenience, Fig. 11 is reproduced from the Rice paper,’ showing 
the building loss in db referred to median field in the streets. Fig. 12, re- 
produced from the same paper, indicates path loss between a half-wave 
dipole and a whip in the streets. (Most of this material originally ap- 
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Fig. 12 — Measured path loss at 150 me between a half-wave dipole and a 
quarter-wave whip in Manhattan, the Bronx, and suburbs. Antenna heights were 
150 feet for transmitter, 6 feet for receiver. 


peared in a paper by Young.’’) From Fig. 12 it is apparent that 109 db 
median path loss in the streets corresponds to a range of about 1.6 miles. 
Thus, we would estimate a little over a mile and one-half range into 
buildings with this receiver, with a 78 per cent chance of success in one 
try. 

Similarly, by interpolating on Fig. 12 we would estimate a range of 
about four and one-half miles in the streets for the path loss to be less 
than 140 db for 78 per cent of the cases. 

The use of 78 per cent reliability for one try ties in with the planned 
reliability of Bellboy system service of 0.99 or better. Thus, by engineer- 
ing a system so that the probability of receiving a single transmission is 
0.78, the probability that failure will occur three successive times is 


(0.22)° or one per cent. Thus the probability that at least one transmis- 


sion will be successfully received is 1 — (0.2)° or 99 per cent. 

Our experience has indicated that this estimate of one and one-half 
miles range into buildings and four and one-half miles in the streets is 
reasonably correct. Obviously, several transmitters will be needed to 
cover a big city. 

It has been found possible to transmit simultaneously from transmit- 
ters with overlapping coverage provided radio frequency is controlled to 
an accuracy of one part per million, ie., 10°. This holds the operating 
frequency of 152.780 me to +153 eps, and keeps beat notes in the re- 
ceiver below the lowest tone in the coding system. 
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Laboratory tests have shown that simultaneous reception of two sig- 
nals produces a slight gain in receiving provided the phases of the modu- 
lating tone differ by less than 70°. This is easy to assure, and modulation 
of the three transmitters in the trial was well within this limit It has 
not been possible to discover any effect, either positive or negative, in 
field testing. If, however, the phases of the modulation at two transmit- 
ters differ by more than 90 degrees there is a definite penalty. When the 


difference is 130 degrees the receiver will not operate if the two signal 


strengths are within 4 db of each other. At 180 degrees the corresponding 
figure is 6 db. These test results were obtained when the frequency differ- 
ence between the two carriers ranged from 20 to 300 cycles per second. 

Fig. 13 shows the locations of the three 250-watt transmitters in lower 
Manhattan. These give coverage in every street location tested below 
59th Street and in practically all buildings. As might be expected, how- 
ever, there are a few dead spots. These occur particularly in the lower 
floors and the interior portions of large buildings which have considerable 
metal in their construction. One example is the interior of the lobby of 
the Waldorf-Astoria Hotel, while another is part of Grand Central Termi- 
nal 

One customer also found that he could not receive signals while he 
was in the hold of a ship docked in Brooklyn. This does not seem surpris- 
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ing, since a ship interior is probably nearly the equivalent of a shielded 
room. 

In general, it appears that the estimated range figures can be used for 
most commercial situations. It may turn out, however, that additional 
low-power transmitters will be needed to cover certain important build- 
ings such as hospitals or large office buildings. 

Transmitters would have to be located fairly close together in places 
like downtown Manhattan, but could be spread out considerably in resi- 
dential areas. The short range, however, is not entirely a- disadvantage. 
It will allow a channel frequency used exclusively for Bellboy service to 
be reused at much shorter distances than is the case with mobile tele- 
phone. 

A potential interference problem exists, however, from the use of satel- 
lite transmitters. This comes about when a mobile telephone customer 
on an adjacent channel drives his car in the near vicinity of a satellite 
transmitter. In this case, the mobile customer’s receiver may be sub- 
jected to a strong field compared to the field he receives on the desired 
channel from a distant fixed transmitter. It appears, however, that 
trouble can be avoided on this score. 

In the first place, the fixed transmitter used on the adjacent mobile 
channels can be coordinated with the Bellboy system so that it will al- 
ways be turned on if Bellboy transmitters are radiating. This will guaran- 
tee that there is at least some field present near every satellite transmit- 
ter. A further improvement in this situation can be effected by the use 
of directive antennas for the satellites. These may be stacked dipoles or 
some similar arrangements providing directivity in the vertical plane, so 
that excessively strong fields just below the satellite transmitters would 
be avoided. 
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The Effect of Driving Electrode Shape 
on the Electrical Properties of 
Piezoelectric Crystals 


By J. A. LEWIS 


(Manuscript received February 23, 1961) 


In the present paper general formulas for the electrical admittance of a 
piezoelectric crystal, in terms of its resonant frequencies and static and 
motional capacitances, are derived and applied to the investigation of the 
effect of electrode shape on the spectrum of resonances and the capacitance 
ratio of the crystal. Particular attention is given in two cases of practical 
importance, namely, small piezoelectric coupling and thin crystal plates. 


I. INTRODUCTION 


Piezoelectric crystals are often used as circuit elements in filters and 
oscillators. Fig. | shows a typical admittance curve for such a crystal 
and lig. 2 shows the corresponding equivalent circuit. At very low fre- 
quencies the crystal behaves like a capacitor, with a capacitance ap- 
proximately equal to the static capacitance between the driving elec- 
trodes. Due to the piezoelectric effect, an applied alternating electric 
field causes the crystal to vibrate and, at certain natural frequencies of 
free vibration, it is driven into mechanical resonance by the applied 
voltage. 

In the neighborhood of such natural frequencies, the admittance of 
the crystal is closely approximated by the simple equivalent circuit of 
Vig. 3. This is the equivalent circuit commonly used in the applications. 
It is a good approximation over a frequency range proportional to the 
spacing between the resonant frequency, at which the admittance is 
infinite (in the absence of dissipation), and the antiresonant frequency, 
at which the admittance vanishes. 

lor small electromechanical coupling this spacing is proportional to 
the capacitance ratio C,,/Co. It is desirable to make this ratio as large 
as possible. Bechmann and Parsons’ have shown how this may be done 
in various simple cases. 
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Fig. 1 — The equivalent electrical admittance of a piezoelectric crystal. 


At high frequencies, where many resonances may occur within a nar- 
row frequency range, the simple equivalent circuit of Fig. 3 may cease 
to be applicable. However, it is always possible, at least in principle, to 
find an electrode configuration which does not excite one or more of 
these resonances. A simple example is a symmetric electrode configura- 
tion, exciting only symmetric modes of free vibration. Vormer’ has 
shown theoretically and experimentally what the appropriate electrode 
shape is for the longitudinal vibrations of a piezoelectric bar. 

In the following, we consider both the question of capacitance ratio 
maximization and that of resonance suppression in some detail from the 
theoretical point of view. The investigation is divided into three parts. 

In Sections II through VI we consider the general problem of steady 
vibrations of a piezoelectric body. The principal tools used in these sec- 
tions are the piezoelectric analogs of various integral theorems of clas- 


sical elasticity, found, for instance, in Love.” We prove, for example, 


that the material particle displacements corresponding to two different 

















Fig. 2 — The equivalent electrical circuit of a piezoelectric erystal 
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Fig. 3 — The equivalent electrical circuit near resonance. 


modes of free vibration of a piezoelectric body are orthogonal just as 
they are for an ordinary elastic body. 

The principal results of these sections are general expressions for the 
equivalent electrical admittance of any piezoelectric body and for the 
motional capacitances. The latter expression indicates how the electrode 
configuration must be chosen to suppress a given resonance. 

Section VII is devoted to the case of small piezoelectric coupling, in 
which, to a first approximation, we may separate the mechanical and 
electrical problems completely, thus making it possible to obtain more 
explicit results than in the general case. A sample result states that the 
capacitance ratio C,,/Co is equal to the square of that portion of the 
input electrical energy which goes into exciting the nth mode, divided 
by the product of the electrostatic energy of the driving field and the 
strain energy of this mode of free vibration. As incidental by-products of 
our calculations, we obtain upper and lower bounds on the change in 
resonant frequency produced by a change in electrode configuration 
and an upper bound on the capacitance ratio. 

Finally, in Sections VIII and IX, we make use of the simplifications 
possible when the piezoelectric body is a thin plate, in particular obtain- 
ing an explicit relation for the electrode configuration maximizing the 
capacitance ratio. As a simple although somewhat artificial example, 
we consider the low-frequency longitudinal vibrations of a piezoelectric 
bar (the example treated by Vormer’) in some detail, using this example 
to point out some of the advantages, as well as the difficulties, in the 
practical application of the foregoing techniques. 

Before commencing our discussion of these problems, a few general 
remarks may be appropriate. In order to apply the techniques presently 
proposed, considerable detailed information about the modes of free 
vibration is required. Until recently, such information was available 
only for a few special cases. However, at least for crystal plates, it ap- 
pears that such information may be obtained by the approximate theo- 
retical techniques developed by Mindlin and his co-workers* over the 

* The output of Mindlin and his coworkers in this field is so extensive that a 


complete bibliography is impossible here. Refs. 4, 5, and 6 are most closely con 
nected with our work. 
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last decade. Experimental methods using an electric probe which may 
be moved over the surface of the erystal, such as the methods developed 
by Van Dyke’ and by Koga, Fukuyo, and Rhodes,” may also yield the 
desired information. Conversely, measurement of the electrical admit- 
tance of a given crystal for various electrode configurations and driving 
frequencies may yield considerable information about mode shapes. 


Il. THE BASIC EQUATIONS 


The equations governing the steady vibrations of a_ piezoelectric 
crystal at angular frequency w may be written in the form 


Ti;.; + pwu; = 0, 
D:; = 0, 


where 7';; is the stress tensor, p the mass density, u; the material particle 
displacement vector, and D; the electric displacement vector. Here and 
in the following, we use Cartesian tensor notation (see, for example, 
Jeffries), in which commas denote differentiation with respect to the 
Cartesian coordinates (2), 2,3) and a repeated subscript indicates 
summation over all possible values of that subscript. Thus, for example, 
2) is just the usual quasistatic electric field equation, stating that the 
divergence of the electric displacement vector vanishes. The symbols 


. . 10 
used for stress, strain, etc., are those used by Mason. 


In the case of a piezoelectric medium the stress 7';; and electric dis- 
placement D; are given in terms of the strain S,;; and electric field EF; 
by the linear anisotropic constitutive relations 


T 
D g Ee + CitmSkm ; 


where S,; and £; are given in terms of the particle displacement wu; and 
electrical potential V by the relations 


(5) 
(6) 


mr . f . . . ° 
Che elastic constants ¢, jxm , piezoelectric constants e,;; , and dielectric 


constants ¢,; satisfy the symmetry relations 
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We shall also have occasion to use the alternate constitutive relations 
Tis = Citen Stem — ReisDre , (7) 
E; = Bix De — RiemStm ; 
Sijtm Tem + deijsEr , 
én Ex + dine T tem ’ 
where 
fort = J, 
fort ¥ j, 
Rikm 


D 


Ci jkm - , = i hy: je pkm 3» 


5 E E 
s, p*q ( jpmq 


aki; = 


i p@q@ , 
€ij ij td pa® ipaq: 
The magnitude of the piezoelectric effect is specified by the piezoelectric 
coupling coefficient k, where k” may be given by any one of the three 


dimensionless ratios, e-/ce, h’/c8, d°/se, with e, ¢, €, ete., being repre- 


sentative values of the corresponding material constants. For all real 
piezoelectric materials k is small compared with unity. 


HI. BOUNDARY CONDITIONS 


The particle displacement u; and electric potential V, satisfying the 
preceding equations in a piezoelectric body B, are completely determined 
by the specification of certain conditions on its surface S (see Fig. 4). 
We shall always assume that the body is supported in such a fashion 
that its surface is free of tractions, i.e., 


T in; ; (11) 








Fig. 4 Piezoelectric body with driving electrodes. 
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on S, where n; is the outward normal vector. This assumption is not 
essential; we could consider just as well the case of a rigid (or compliant ) 
support. It does, however, make the subsequent algebra somewhat 
simpler. 

The body is assumed to be driven by an alternating voltage of con- 
stant amplitude V,, applied between electrodes So and S; , plated on 
its surface. Thus 

Vi, on S,, 
(12) 
Q, on So. 


The remaining surface is assumed to be free of plating. On this por- 


tion of the surface, then, one has two conditions requiring that the po- 
tential and normal electric displacement be equal to the potential and 
normal electric displacement in the external field. We may take this 
external leakage field into account formally by simply requiring that 
the equations of the preceding section hold throughout all space (except 
on So and S,), with the stress, strain, and piezoelectric constants van- 
ishing identically outside B and the dielectric constant outside B being 
that of free space. In the following we shall indicate that an integral is 
to be taken over all space by using B, in place of B. To complete the 
set of boundary conditions, we assume that V vanishes at large distance 
from B. The external field is included here only for formal logical com- 
pleteness; in almost all practical problems it is of negligible importance. 


IV. INTEGRAL RELATIONS FOR PIEZOELECTRIC BODIES 


Suppose that u; and V are any (suitably continuous) vector and 
scalar functions, defined on B + S. Note that, by our previous con- 
vention, V is actually defined throughout all space. We shall assume 
that V is continuous across S and zero at infinity. In general, of course, 
u, and V will not satisfy the equations of the preceding section. However, 
by introducing suitable volume and surface forces, FP; , T; , and volume 
and surface charges, gs , gs , We may construct a boundary value problem 
satisfied by these functions. 

First we calculate strains and electric field components, S;;, 2; , 
from (5) and (6) and stresses and electric displacement components 
from the constitutive relations, (3) and (4). The required distributions 
of volume force and charge are then given by the relations 


oF (T + pw u 
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where w is an arbitrary positive constant. Similarly, the distributions of 
surface force and charge on S are 

T: = Tim;, 
qs [Dijn; , 


where n; is the outward normal to S and [Dj] is the jump in the electric 
displacement. vector across S, given by 


[Di] = (Didext — (Di). 


Now suppose we have any two pairs of such functions (u,", V’), 


(u;”,V”) and calculate the corresponding stresses, electric displace- 
ments, etc. Then the divergence theorem yields 


Jon ” toy 7 Y I ony ” 
[ u;T,;; dB = [ w. Tn; dS — / S,;; 7T;; dB, 
“2 bal B 


J 


[ V'D,.” dB = — | VID." ndS + [ B/D,” aB. 
B 


on. vs 


or, in terms of the equivalent forces and charges, 
, wv yo ow Fun . . dua & - 
| pu: (PF; +. u; )dB —_ u;T; dS + | Si Ti; dB (15) 


“B “Ss al 


a a . 


yf ” x? ” ’ > / ” » 

| Vqe dB _ Vgqs dS + | E;D; dB. (16) 
” By “Ss "Ba 

lor example, for the forced vibrations previously considered, with 

(ui’,V’) = (ui”,V") = (ui, V), 0” = wo, Fi” = Ti” = gq,” = 0, 


tion and rearrangement of (15) and (16) yields 


addi- 


| (Cijkm SijSkm + €ix Lik — pwun:)dB=ViQi, — (17) 
a 


x 


where V; is constant and the total charge on S; is 


i = [ as as [Di|n; dS. (18) 


Note that, by our previous convention, ¢; jem” and u; vanish outside B. 
Mquation (17) essentially states that the sum of the potential energy, 
made up of the strain energy and electrostatic energy, and the kinetic 
energy is equal to the energy passing into B through S; due to the ap- 
plied voltage V, . 


For free vibrations, with (u;, V,) (ui",V",on), (17) yields the 
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Rayleigh quotient for the natural frequency w, , 1.¢., 


"EB. ) dB 


pu; u; dB 


This relation suggests that the effect of piezoelectricity is to increase the 

natural frequencies over the values they would have in an ordinary 

elastic body with the elastic constants ¢; jm. In Section VII we shall 

prove that this is the case for small piezoelectric coupling coefficient /. 
If we subtract (16) from (15) and rearrange terms, we obtain 


- . 


[pu a F ” ‘ Pa 2”) _ Van’ | dB ms | (u.T; 


“BRB Ss 


x 


” 


- Vas" dS 


(f.0." — 8,0; ae. 


“B 


x 


This equation still holds when we interchange primed and double-primed 
quantities. Furthermore 


S;"T y = §,;/T;;" — ED’, 
and thus 


| [ow (F us’) — Vqn |dB + | (ui TT” — V'qs") dS 


“B “Ss 


(20) 


ai | [pu,” ( # l J ae Van | dB ds | (a “7! in Vas. dS. 


*e “Ss 


This is the analytical form of the so-called reciprocal theorem (see Ref. 
3, Chapter VII, p. 174) for a piezoelectric body. In words it may be 
stated as follows: 

The difference between the mechanical and electrical work done by the 
forces (including kinetic reactions) and charges of the first set, acting over 
the displacements and potential produced by the second set, is equal to the 
difference between the mechanical and electrical work done by the forces 
and charges of the second set, acting over the displacements and potential 
produced by the first. 


V. EXPANSION IN: NORMAL MODES 


We now return to the problem of forced vibrations, described in Sec- 


tions I and II. The equations and boundary conditions of those sections 





ELECTRICAL PROPERTIES OF PIEZOELECTRIC CRYSTALS 1267 


completely determine a unique particle displacement u; and potential 
V’, except at certain natural frequencies w,(n = 1, 2, ---) where a non- 
trivial solution (u;",V") of the free vibration problem (with V, = 0) 
exists. If we denote the solution of the static problem (with w = 0) by 
(u;,V"), we may satisfy all of the preceding equations and boundary 
conditions, except (1), by setting 


x 
s nm 
U=Uu + D> anu; ; 
n=l 


os 
V = V+ 2 a,V", 
n=l 
where the a,’s are to be determined to satisfy (1). Since 
73.5 = 753.3" + pwr us” = 0, 


in this case (1) is satisfied if 


’ 


x 
p 5 (w, — w )a,U; = pwli. (23) 
n=1 


r; 


Now suppose we set (wi’,V’) = (u:",V™"), (i7,V”) = (14:",V" 
Wm F~ w,, in (20). We obtain 


), with 


“ 


ptm — w, ) | u;"u;' dB = 0. (24) 
B 


so that displacements corresponding to two different natural frequencies 

are orthogonal, just as they are in ordinary elasticity. Thus, multiplying 
. k ° ° . 

(23) by u; and integrating over B, we obtain 


“ e 


2 2\ k k 2 x k 
pla, — w )day, | u;u; dB = pw | u;u; dB. 
at a 
Again applying (20), this time with (u,’,V’) 
(u;,V"), we obtain 


pur’ | us dB | VIDS |In; dS 
B 


es 


iw Vi | [Ds‘|n; dS 


pw ( ax? — w?) 


lor future reference, we also note the identities 
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2 "ry 
Pe / uu” dB [ S,;"°T 33" dB, 
B it 


[ vitp."n. as | ex SyBs” — By"B,*) aB, 


“S) /B 


obtained by direct application of the divergence theorem. 
VI. THE EQUIVALENT ELECTRICAL ADMITTANCE 


Using the apparatus Jeveloped in the previous sections, it is a simple 
matter to obtain a general expression for the admittance of the crystal 
body. The admittance Y(w) is the ratio of the total input current /, 
into the crystal to the voltage V; across its terminals. The current is the 
rate of increase of the total charge Q, on the electrode S; , where 


Q) [ [Di\n; as, 
Js, 


so that 


Y(w) = I, V; = Jol); V; = JoV, ; [ [Di|n; ds. 


“S) 


Substituting 


[D,| [D.] + 3 a,,{D;"}, 


we obtain 


6 » 

; ~ oC, 

Y (w) iw (c +> x .} 
n=1 W@W," — @ 


where the static capacitance (C, is given by 


Se | [D,"\n; dS. 


“S) 


and the motional capacitances C,, by 


/ [D;"\n, is) 
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(28) may also be written in the form 


Y(w) = jw (« + , 7. ) . 
n=l F 


lig. 2 shows the corresponding equivalent circuit with 
Dis = l ‘ite Ue ’ r= iF 2, ‘nas 


and Fig. 1 shows the corresponding behavior of the admittance with 
frequency. Since C,, «< C, for real piezoelectric materials, the admittance 
is very nearly equal to jw, , except in the vicinity of a resonant fre- 
quency, where the admittance is that of a simple series-resonant circuit, 
shunted by the capacitance Cy. The corresponding antiresonant fre- 
quency w,' is given approximately by 

wo 14+, 


Wn” Co 


; 


or, since C,,/Cy < 1, 


Aw, Wn — Wn _ Ch (39) 
Wn Wn 2Co 

All of these considerations hold when the resonant frequencies are not 
too closely spaced, specifically when (@,41 — wn)/w, is large compared 
with (,,/Co. At high frequencies, where many resonances may occur in 
a narrow frequency band, the region of applicability of the above simple 
single-resonance circuit may be very small, and the simple relation for 
the antiresonant frequency w,’ given by (32) may not be an adequate 
approximation. In order to circumvent this difficulty, we may choose 
the shape of the electrodes S; and Sp so that resonances in the vicinity 
of a desired resonance in this frequency range are not excited, i.e., so 
that the corresponding C,,’s vanish, or 


/ [D;"|n;dS = 0. 


This is a necessary and sufficient condition that the nth resonance not 
be excited by the given plating shape. 

In order to make use of this condition, the surface charge [D;"|n; must 
be determined, either theoretically, perhaps by using an approximate 
theory of the type applied so successfully by Mindlin and his cowork- 
ers,*”’° or experimentally using a point electrical probe as developed by 


Van Dyke’ or Koga, Fukuyo, and Rhodes.’ In the case of high-frequency 


vibrations, one would expect that the resulting electrode shape would 
be quite complex. Clearly the choice of electrode shape suppressing a 
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finite number of resonances is not unique. For these and other even more 
pressing reasons, it is not clear whether the use of the condition is of any 
particular practical importance. 

Besides affecting the value of C,,, the choice of electrode shape also 
affects the magnitude of the static capacitance C, (or Co). Since Co 
shunts the series-resonant elements, it is desirable to make its value as 
small as possible, i.e., to make the capacitance ratio C,,/Co as large as 
possible at the desired resonance. Finally, because of piezoelectric cou- 
pling, the values of the resonant frequencies themselves depend on the 
electrode shape, although only to second order in the piezoelectric cou- 
pling coefficient ’. In order to obtain more concrete information con- 
cerning these effects, in the next section we consider the case of small 
piezoelectric coupling. 


VII. SMALL PIEZOELECTRIC COUPLING 


In this section we shall obtain expressions for the static capacitance, 
the capacitance ratios, and the shift in natural frequencies due to a 
change in electrode shape, valid for small piezoelectric coupling coeffi- 
cient k. With an eye to the applications, these expressions should be the 
most convenient forms for application to the most common practical 
case, namely, a crystal plate. 

First of all, it is clear that, to first order in k, the static capacitance 
C’, is simply the ordinary electrostatic capacitance in the absence of any 
piezoelectric effect. Thus, to first order, the static potential V" satisfies 
the equations 


D; 


in B, and 
V; ‘ on Si " 
Q, on So . 


(In this instance, one should recall our previously agreed upon conven- 
tion concerning the treatment of the external region.) The static capac- 


itance is then given by 


[ €,; EE; db. 34 ) 


“B 
x 


Next we consider the solution of the free vibration problem for small 
coupling. To first order in k the particle displacement u;" and natural 
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frequency w, are those of the purely elastic problem, and the potential 
V" and electric displacement D,;" are of first order in k. Finally, the 
change in natural frequency is of second order in k. Thus we set 


w = wo(l + v”), 
where the quantities with index “0” are of zero order in k, those with 
single primes first order, and those with double primes second order in 
k. We have also temporarily dropped the index ‘‘n”’ in order to reduce 
the number of indices present. The governing equations and boundary 
conditions then become 
mn 0 2 0 . 
T ij.5 + pwo ti : in B, 
ons, 


in B, , 
on S, 
in B, 
on S, 


where the 7';;’s and D,’s remain to be defined. 

The definitions of these quantities, and thus u,", wo, V’, u:”, and v” 
depend on the choice of constitutive relations used. For example, if we 
use (3) and (4) for the stresses and electric displacements in terms of 
the strains and electric fields, we have 

Srv ’ 0 
Di = ei Ee! + CiemStm ; 
ijem™ Sem” — Orig Ee’, 


whereas if we use (7) and (8) we have 


Use of either of these sets of relations should give the same results (to 


the order considered) for the potential WV’, the electrie displacement 
D,’, and the natural frequency wy (1 + v”). We may apply the 
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reciprocal theorem, (20), to obtain a simple expression for v” in either 
case. We set (ui’,V’) = (uu; 0), (ui”,V”) = (1:",V’) to obtain the 
expressions 


e - 


ci*E'E; dB/pos’ | usu? dB, 
“3 


- 


‘DD; dB/poo? | uw. ue dB, 
‘BR. “RB 
where we and wp are the natural frequencies found using the elastic 
constants Ci jim) and Ci jem”, respectively, and vg and yp are the corre- 
sponding frequency shifts due to the piezoelectric effect. Since, to second 
order, 


and veg = 0, < 0, we must have 


Q= 
< Wp, od) 


with equality only if ve or vp vanish. According to (35) and (36), this 
can only happen if £,’ or D;’ vanishes identically, 1.e., if the divergence 
of the vector @igm Sim Vanishes in the former case, or if the curl of the 
vector hic» Sim Vanishes in the latter case. Thus, in general, neither of 
these bounds will be attained. We can, however, prove that the lower 
bound wg is most nearly attained by a completely plated crystal and 
the upper bound wp by a completely unplated crystal. Furthermore, 
making the electrodes larger always decreases the resonant frequencies. 
These results, which are not of central importance in our present con- 
siderations, may be obtained by the application of Schwarz’s inequality 
and Green’s identity to (35). 

We have now obtained expressions for the static capacitance and the 
shift in resonant frequency caused by the piezoelectric effect. We now 
derive a simple and symmetric expression for the capacitance ratio 
CC, , using (26) and (27). We have 


— Si; ky, )dB 


neglecting terms of higher order in /:. Also, from (26 


f j ‘ nryr y 
pw ! u, u; dB S;; 7 db, 


"2 "3 


which, in the present case, is either the strain energy in terms of €; jn’ 


kn 


. D ‘ : : : ‘ 
or in terms of Cijem , depending on our choice of constitutive relations. 
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To first order in k the two expressions are equal, of course. Finally, using 


(| we iB) 

i. 

(| 6 EVE, iB) (| Cijem Sis" St iB) 
oe tall 


Direct calculation, again using Schwarz’s inequality, leads to the upper 


, 


(34), we obtain 


bound 


Again we will not dwell on this incidental result, only remarking that it 
provides a general upper bound on a quantity which we usually wish to 


make as large as possible. An alternate form of (38), in terms of stresses, 


(/ Aik, Ti;" iB) 
“B 


Is 


VIII. CRYSTAL PLATES 

In this section we consider a thin piezoelectric crystal plate, driven by 
symmetrically placed electrodes on its face (co = 0, v2 = h), as sketched 
schematically in Fig. 5. In this case, if the thickness A is small compared 




















The erystal plate 
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with the diameter of the electrodes, the static capacitance is given ap- 
proximately by 

C, = €2 A/h, (41) 


where A is the electrode area. To the same approximation, the static 
field is given by 


for (x; , 23) in A, 
for (.t) ,.v3) outside A. 


(We now neglect the external field completely.) Thus we have 


ll e:;;E. S;;" dB — V, [[ S "(a1 , 23) dt, dzs, 


- 
where 
115 t 


or, in terms of average stresses, 


[ a, E.T;; dB 


“R 


where 


With 


dB 


where 
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in terms of strains, and 


de, iT; " 


"2. 


in terms of stresses. Note that the integrals in the denominators of these 


Dp: 


two expressions for p, are simply normalization factors. We retain them 
only to keep the dimensions straight. 

Mquation (42) gives the capacitance ratio in a form which is particu- 
larly easy to study for given p, . We wish to know either how to choose 
A to make (,,/C, zero, in order to suppress a given resonance, or how 
to choose A to make (,,/C, as large as possible, to increase the useful 
bandwidth of the crystal. 

Clearly the former problem has many, many solutions. If, for example, 
p, is both positive and negative over the plate area, as will be the case 
unless we are dealing with some sort of fundamental mode, we need 
only distribute the electrode area A so that the integrals over the posi- 
tive and negative portions are equal. The apparent difficulty when p, 
is everywhere positive (or negative) can be circumvented by a simple 
artifice. We obtained (42) by assuming at the outset that voltages of 
the same polarity were applied between all portions of the top and bot- 
tom electrodes. If instead we imagine A to be divided into two parts, 
A* and A, with voltages of opposite polarities applied to these parts, 
then 


af a” 


| Pr day divs = [/ Pr day divs “me I Pn day dx» 9 


. 
which clearly can be made to vanish by choosing A” and 4A so that 
the corresponding integrals are equal. 

The problem of maximizing the capacitance ratio is mostly visua- 
lized simply as a geometric problem. First of all, note that in this case we 
may replace p, by its absolute value in the integral in (42), for in re- 
gions where p, is negative we may assume that the polarity of the driv- 
ing voltage has been reversed. This obviously will always increase the 
capacitance ratio over the value it has with unreversed polarity in such 
regions. Now we imagine the surface 2. = | pprlai, 23) | to be erected 
above the plane 2. = 0. The integral over | p, | can then be interpreted 
as the volume between this surface and the base plane, cut out by a 
cylinder of cross section A. Denoting this volume by V, we then must 


choose A to make V"/A as large as possible. This problem has a very 
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simple solution, although one which is not particularly well adapted to 
calculation. 

First of all, we observe that, for fived area, to maximize the volume 
we should choose A to be the region bounded by the appropriate level 
curve of | p, |. Any other choice of A, giving the same area, must con- 
sist of this region less a subregion plus an external region having the 
same area. But the value of | p, | for points outside the level curve is 
smaller than for points inside, so that the integral can only be decreased 
by this alternate choice. 

Let us denote the area bounded by the level curve Dn( X1 , X3) 

p = a constant by A, and the corresponding volume by VV. We must 
now choose A, so that V A,1s maximized, i.e., so that 
GUY ate 
dA, 
Now the change in V, due to an increment AA, in A, is pAA,, since 
the height of the surface above the base plane is constant around its 
boundary, and thus 


d(V,?/.  (2pA, 
dA, A? 


which vanishes when pA, }V,, Le., when the volumes above and 


below a plane through the level curve are equal. Explicitly, to maximize 
the capacity ratio we have the condition 


pA, - I/ Pn (01, x3) | dry dirs , 


{ 


where A, is the total area of the electrode on either plate face, bounded 
by the level curve | p,(a@1,2 p, and p, is given by (43) or (44). 
We also assume that the electrode polarity is positive when p, is posi- 
tive and negative when p, is negative. Since A, must of course be less 
than or equal to the plate area, the maximum capacitance ratio given 
by (45) may not actually be attained. If this is the case, a plate wholly 
covered with electrodes gives the largest possible value to the capacitance 
ratio. It may also happen that the maximizing area A, may be partly 
bounded by a level curve and partly bounded by the plate edge. 

Bechmann and Parsons’ have considered the excitation of piezoelec- 
tric bars and plates with partially applied electrodes and have obtained 
results similar to the above for the capacitance ratio, both theoretically 
and experimentally 
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IX. THE LONGITUDINAL VIBRATIONS OF A PIEZOELECTRIC BAR 


To illustrate the foregoing results, we now consider a concrete example. 


To keep the analysis simple, we treat the example discussed by Vormer® 
of the low-frequency longitudinal vibrations of the piezoelectrie bar 
sketched in lig. 6. We assume that h <a < L and restrict our discus- 
sion to vibrations of wavelength large compared with a. Then we may 
assume that all the stresses 7';; are zero except Ty 
a function of x; only. Thus, for free vibrations, 


, which we take to be 
Ti) Ty" vy), 
satisfying the equation of motion 
Ty 1 
and the stress-strain relation 
Si" Uy * Sun Tn". 
Kliminating uv," between these two equations leads to the equation 
Tua + (w, c) Ty" 0, 


, aaa ee 
where ¢ 1 /psiun . Together with the boundary conditions 


this equation vields 


Th" (ty) sin nary / L, 


If we assume that the width of the electrode A at x wi/L is af(x), 


as in big. 6, we find that the capacitance ratio is given by 


: 2k (4) | | flv) sin nar rm 


al 





Che erystal bar 
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where / is the piezoelectric coupling coefficient, here given by 
(doy) 
Si’ €22° 
al, is the bar-face area, and A is the electrode area, given by 
“1 


Ao | f(x) | dx. 


ad 


We must assume that 0 S_ f(x) | S 1, and we may take into account 


portions of electrode of reversed polarity by assuming that f(.) may 


be both positive and negative. 
lor the wholly covered bar (f = 1), for example, (46) gives 


9 


thon’, for? = 1,3, ><. 
Q, forn = 2,4, :--- 


(Clearly an even excitation, such as the above, cannot excite the odd 
modes.) This already provides an example of resonance suppression of 
a very trivial nature. In the present case, within the limitations of the 
theory, we can actually find an electrode shape which suppresses all 


resonances except one, for the 7),"’s are orthogonal. Thus, if we set 
flax sin mre, 
we find 


for n m, 
otherwise. 


Of course, since the application of the present simple theory is restricted 
to low frequencies, we cannot expect the above to be valid for large n 
and m. On the other hand, the low-frequency resonances are sufficiently 
widely spaced so that resonance suppression is of no particular practical 
importance in this case. Thus the present example must be taken as 
illustrative, rather than practical. 

In the present case of one-dimensional vibrations, the level lines of 
p, , considered in the previous section, are straight lines .; a con- 
stant. Thus the electrode shape maximizing the capacitance ratio con- 
sists simply of a sequence of rectangular bands of suitable widths and 
polarities, extending across the width of the bar. To determine the opti- 
mum width, it suffices to consider the fundamental mode (n 1). For 
convenience we shift the origin of coordinates to the center of the plate 

so that 
rT! 


l 


vr} 
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and assume that the electrode band covers the interval |x! S< 2). We 
must choose 2x» according to (45), 


Zo 
2a | cos wz dx = 2a( 2x cos rx), 
0 


tan rity = 212%, 


which is satisfied by x» = 0.371, which is the value found also by Bech- 
mann and Parsons’ in this case, i.e., 74.2 per cent of the bar length cov- 
ered by electrodes. This maximizes the capacitance ratio C)/C, for the 
fundamental mode. The capacitance ratio for overtone modes may be 
maximized similarly, since the overtone modes may be imagined to be 
made up of a set of fundamental modes for n bars, 1/n in length, set 
end to end. 

or the fundamental mode, this maximum capacitance ratio is 0.909k" 
compared with the values 0.811 and 0.786k° for a completely covered 
bar and for a bar with sinusoidal plating which suppresses all overtones. 
Thus we may increase the capacitance ratio about 10 per cent over its 
value for the wholly covered bar by the above technique. 

A difficulty which we do not encounter with the fundamental mode 
shows up when we consider the excitation of the first overtone (nm = 2) 


















































Fig. 7 Various electrode configurations for excitation of first overtone in 
crystal bar 
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In order to excite this mode, we must use electrodes having odd polarity 
about the middle of the plate, as shown in Fig. 7. For example, we may 
use complete (split) electrodes, as in Fig. 7(a), sinusoidal electrodes, as 
in Fig. 7(b), or band electrodes, as in Fig. 7(¢). Especially in the first 
case, there will be a substantial contribution to the total static capaci- 
tance from the capacitance between adjoining electrodes of opposite 
polarity. This contribution has been completely neglected in our pre- 
vious calculations. To make it small compared with the ordinary plate 


capacitance, proportional to electrode area, the spacing between adjoin- 


ing electrodes of opposite polarity should be large compared with the 
plate thickness. This consideration makes the band electrodes [Fig. 7(¢ ) |, 
which maximize the capacitance ratio, particularly attractive. 

The presence of a substantial electrical field component, parallel to 
the bar faces, also changes the form of the function p, and thus the 
motional capacitance. At high frequencies, with a complicated resonance 
spectrum, this field component may excite unwanted resonances. In the 
present case of low-frequency longitudinal vibrations, this effect is 
probably not important. 
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Asymptotic Behavior of General 


Queues with One Server 
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The asymptotic behavior of the virtual delay in a single-server queue with 
order of arrival service is studied, under no restrictions on the stochastic 
nature of the arrival process and the service times, except for a weak sta- 
tionarity condition. This behavior is shown to be governed by a functional 
equation closely analogous to the “fundamental equation” of branching 
processes, already used in special queueing models. A generalization of the 
Pollaczek-Khinchin de lay formula is de rived for the CASE in which de lays 
do not build up. 


I. INTRODUCTION 


There is a queue in front of one server; waiting customers are served 
in order cf arrival, and no defections from the queue occur. The delays 
incurred in such a system can be described by a stochastic process 
|W (t),t 2 Of, the virtual waiting time, defined as the time a customer 
would have to wait for service if he arrived at time ¢. Stochastic processes 
of this kind have been studied.'” 

In the present work we examine the asymptotic behavior of the prob- 
abilities Pr} W (4) J wihast becomes large, for a class of processes satisfy- 
ing a weak condition of stationarity. 

It is customary to define the process W(-) in terms of the arrival 
epoch ¢, and the service time S; of the Ath arriving customer, for 
k 1, 2, «++. We can, however, deseribe the service times and the 
arrival epochs simultaneously by a single function A(-), defined for 
{ = O, left-continuous, nondecreasing, and constant between successive 
jumps. The locations of the jumps are the epochs of arrivals, and the 
magnitudes are the service times. With K(0) WO) for conveni- 
ence, A(t) can be interpreted as the ‘work load” submitted to the 
server during [0,0), and W(t) as the work remaining to be done at ¢. 


1281 
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Formally, W(-) is defined in terms of K(u) forO S u < t by the equa- 
tion 


at 
W(t) = K(t) —t+ | U(—Wiua)) du, 
where U(-) is the unit step function, ie., U(x) = 1 for x 2 0 


U(x 0 otherwise. Equation (1) may be interpreted as follows: 


work remaining at ¢ = load offered up to ¢ 
— elapsed time 
+ total time that server was idle 
in (Q,¢). 
The relationship of W(-) and A(-) has previously been described 
. 1.2 , ; . 
and illustrated.” A principal result of the latter paper (and the only 
one needed for reading the present one) was a formula for the distribu- 
tion of W(t) in terms of the functions 
Pri K(t) S ul, 
Ri tusw Pri k(t K(u) —t+ <w)| Wu) QO}. 
The reference to W(-) in the condition is not circular because, as shown, 


the events 


{Wiu) = 0},”7 ; Sup lK(u) — Kiy) —~ wt y| <= O} 


differ by at most a set of measure zero. The formula for distr} W(t)} is 

PriW(t) < w! PriK(t) —t< wt 
0 ia r 

— R(tujw) Pr} Ww) 


Ow vo 


or, in an integrated form, 


| Pr{/A(t) S ut du Pr{/K(t) < u} du 


a | Ritujw) Pr} Wa) O} du. 


For —t S w S O the chance Pr} W(x) 0} that the server be idle at 
time wu satisfies the Volterra equation 


| Ritujw) Pri Wa) O! du Pr{K(t) < u} du, 


EH \max|0,w — K(t) + 
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These results can be cast into a form more useful for the present en- 
deavor by use of Laplace-Stieltjes transforms. Theorem 5 of the author’s 
paper implies that for Re(s) > 0 


, swift) 
‘ y i 
Va {= 
(6) 


~|W(u) = 0} Pri W(u) O! du. 


This formula for the Laplace-Stieltjes transform of the distribution of 
W(t) can be obtained directly from (4) and (5) by taking the Laplace- 
Stieltjes transform with respect to w. 


Il. BASIC ASSUMPTIONS 


Unless the input process A(-) has some asymptotic temporal uni- 
formity, one cannot expect W(t) to have a limiting distribution as 
t —» x. We shall assume that the increments of AK(-) are weakly sta- 
tionary in the sense* that the basic kernel R(t,u,w) of (3) depends only 


on (t — u) and w, Le., 
Prik(t) — K(u) ~—t+usw)| W(a) ()! R(t — uw). 


The word “tweak” refers to the dependence of the kernel only on the 
difference of its two temporal arguments. 

The stationarity condition (7) turns (5) into an equation of con- 
volution type. Our strategy will be to work with (5), find conditions 
under which Pr} W(aw) = O} converges as u — «, and then use Abelian 
arguments on (4) and (6) to obtain the asymptotic behavior of 


PriW(t) Ss w} ast— , 


. . . . 3.4 
A similar approach has been used in two prior papers by the author. 


We shall only consider the case in which 
WO) Q, (8) 
with probability one. This ‘“‘start empty” condition is in a sense un- 


avoidable in practice, and it leads to simple, elegant results. Equations 
(7) and (8) imply that for éy 2 0 


WO) 0} 
We CR) 0} 


| e "R(tdw). 


“oa 
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The condition | W(- ) 0} will therefore be omitted from now on in all 
probabilities and expectations. 

The following functions oceur frequently in the discussion and merit 
abbreviations: 


E\} max |0,t 
Pri K(t) < R(t,0) R(t), 
Pri W(t 0} Pit). 


In this notation, the Volterra equation (5) for w = 0 can be restated as 
| Rit — w)P(w) du. (10) 


The Laplace-Stieltjes transform of a function is denoted by the cor- 
responding lower-case Greek letter; e.g., 


a 


g(s) | e 'dF(t), Re(s) > 0. 


The ordinary Laplace transform is denoted by the corresponding capital 
Greek letter; e.g., 


| e P(t) dt. 


From (1) defining W(-) it is evident that the mean delay is 


E\W(t)} = EYK(t)} —t + | Pri{W(u) O! du. 


Also, we have 


BE} K(t)} > uj du 


| ll — Pr{K(t) < ul) du + | Pri{kK(t) > u' du 


be 


ex 


t— F(t) + | Pr{K(t) > u} du, 


E}max (0, K(t) — <u} du. 


In addition to the stationarity condition (7) we use the following 
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assugnptions (since some theorems do not depend on all the assumptions 
to be listed, the relevant hypotheses are repeated whena result is stated ) : 
- ; lev up 
i. limsupt E\K(t)} < «; 


t+»% 


ii. lim inf ¢ ‘ log PriK(t) = 0} > —«; 
t+» 
iii. There exists a neighborhood N of the positive real axis and a T > 0 
such that Efe “*'?! ~ Ofors <¢ N andt> T. 
Hypothesis iii will only be needed when the system is recurrent-null; 
i.e., the service-factor is unity. 


Ill. SUMMARY 


Basic assumptions and preliminary lemmata occupy Sections IL and 
IV, respectively, the principal result being the existence of a solution 
s( 7) of the functional equation 

T= 8(T) Qa. S(T) ), 
where, for real s = 0, 
. 1 ’ sK , 
a(s) lim sup t log E}« "| W(0) 0}. 
t +X 
This functional equation is closely analogous to the “fundamental equa- 
tion” of branching processes. In Section V it is shown that the Laplace 
transform of Pr} W(- ) O} is just 1/s(7), whence it follows (Section VI) 


that the asymptotic value of Pr} W(¢) O} as t-> % (if any) can be de- 


termined from the limit of a(s)/s as s > 0. 

The existence of a limiting value Pr} W( « ) 0} is investigated in 
Sections VII and VIII, by Mercerian and Tauberian methods respec- 
tively, applied to (5), for w = 0. Section LX contains limit theorems for 
Pri W(-) < wh, w > 0; these are obtained readily by Abelian methods 
from (4) or (6), once the convergence of Pr} W(t) 0! ast— « has 


been ascertained. 


IV. PRELIMINARY RESULTS 


Let s = 0, and define a(s) to be the abscissa of convergence of the 
Laplace transform* 


WV (7,8) ( "Eh ia t dt. 


Obviously a(Q) 0,a(s) S Ofor s = 0, and a(-) is monotone decreas- 
ing ins 2 0. It is not obvious that a(s) # — *, at present. 


*We assume throughout that Eje-**(} is a measurable function of ¢ for s 
values under discussion 
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Lemma 1: The abscissa a(s) of convergence of the integral 


sk 


"t dt 


W(7.s) 


is given by the formula 
; 1 8K 
a(s) = lim sup t log E}e s 20 
[+x 


K(t)) 


Proof: Let s > 0 be fixed and set Eje*™ |} = y(t). The function 
¥(-) is monotone decreasing, and satisfies 0 S y S 1. Hence, by The- 
orem 2.4d of Widder, the abscissa of convergence of the integral 


“x 


i ( dt) 


lim sup ¢ ' log | y(t) | S 0. 


However, 
“7 


[ ¢ “ dwit "y T) + r | ( “y(t) dt, 


0 “0 


so that (11) and (12) have the same abscissa of convergence. 
Lemma 2: For s,h > Oand u = O 


DAY . 1 BS 6 S BN iB 


Proof: Let K(u) E. Since 2°*" is convex in x" and in 2’, it follows 


S 


from Jensen’s theorem that 


AY 


Multiplying these inequalities ‘ther gives the result. 


Lemma 3: If 


lim sup FE} A(u)/ul < B, 


L 


then, given « > 0, there exists A so small and ¢ so large that 


forall u > t 
Proof: Choose h so that h8 < ¢, and t so that 


E\K(u)} < Bu 
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for all wu > ¢. Then, by Jensen’s theorem, for u > ¢, 


’ hK(u . n Et 
a) t > 
he > 


Lemma 4: Uf 


lim sup HE} A(u)/uj} < x, 
t+ 
then for given « > 0 and any s > 0, there exist h and ¢ so that 


1 , sK(1 1 ’ a+h) K(u J 
t log E}e “t—t log Efe | << 


forall u > ¢. 
Proof: Lemmata 2 and 3. 
Lemma 5: a(-) is a continuous function of s,s 2 0. 
Proof: Set t' log Efe **°?} q.(t). Let « > 0 be given, and find, by 


Lemma 4, an h and af so that u > ¢ implies 
OS gu) — gepnlu) S 


There exists a sequence wu, > along which 


€ \ 
a\s) — s O\.Ua! , 


€ 


Jern(,) S Pt A) +s 


for all n sufficiently large; this is because 


a(s) lim sup g,(¢). 


t+x 


For such n, then, since a(-) is monotone decreasing and (13) holds, 
0S als) — als + kh) SB gun) — % 


Osa(s)—als th) S « 


; 


so that a(-) is right-continuous. A mirror image of this argument proves 
a(-) left-continuous. 

Lemma 6: a(+) is Convex in s = 0. 

Proof: For each t > 0, Efe “““°} is a completely monotonic function 
of s, and so (Widder,” p. 167) is logarithmically convex; indeed, in 
s = 0 we have 

, log Efe 
t as- 
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This implies that for s,h 2 0 
h) = gi(s + $h). 


| 
+ ? 


Sgius) + 3gr(s8 
Taking the lim sup as t > ~, we find that a(-) is convex. 
Lemma 7: For s,h > 0, 

aris + h) 


sth 


and 


Ss 


5 ars 
—lim > 0 


exists as a finite or infinite limit. 
Proof: If — is a positive variate then 


EN'{e'} 


(See Loéve,” p. 156 Choosing gE 


is a nondecreasing function of 7 


mast 
. we have 


f 


lim sup g: 


x 


14) is true. 


sO) 
Theorem 1: If 

lim inf ¢ | log Pr} K(t) i S-— 0 

then for each r => O there is at least one number s(7) such that 

a(s 


iors 3 ; (17) 


for s 
a constant ¢ > O such that 


? 
Is 


] 


Proof: The hypothesis implies that there 


Pri A(t) = O} = « 


for all sufficiently large ¢. However, this implies that 


t 
> 


Bye Pe 
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for all ¢ large enough, and so a(s) = — ¢ for all s 2 O. Since a(-) is 
continuous it must intersect the line 7 — s at least once, and the rela- 
tions (17) follow from the convexity of a(- 

This section ends with some preliminary analytical results for F(- 
and P(-). 

Lemma 8: If R(-) is of bounded variation in every finite interval, 


then 
F’(t) d/dt F(t) 


exists almost everywhere. 
Proof: The hypothesis implies that the transform 


“x 


p(s) e dR(t) 


exists in Re(s) > 0. From (10) we find 

ols) p(s) | e P(t) dt p(s)II(s), 
but II(-) is the Laplace-Stieltjes transform of an absolutely continuous 
function; hence g( +) is also. 

In order to use various Tauberian theorems it will usually be neces- 
sary to impose ‘Tauberian” conditions on the oscillations of ?(-). We 
recall (Widder,’ p. 209) that a function f(-) is slowly oscillating if 

lim f(y) — f(r) (0, 


y—z-0) 
j» 


and f(-) is slowly decreasing if 


lim inf f(y) — fir) = 0 


y—r+0 


asr— C,y BCEy oo. 
Lemma 9: If F’ is continuous uniformly in ¢ 2 0, and R(-) is of 
bounded total variation, with the form 


R=U+H5,+ 


where U is the unit step at zero, H, is absolutely continuous, and 


| dH 


then P(-) is slowly oscillating. 
Proof :* Since F’ exists we may differentiate (10) to find that 


* The procedure of this proof is patterned after that of Karlin.? 
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"(t) | P(t —u) dR(u). 


This is equivalent to the renewal equation 
P(t) F’(t) 4 | Pit —u)d,{U — R}, 
or, iterating n times, 


Pa= |] Pa-wdaih+My+] Pu-—w dN), 


“0 


where L is absolutely continuous, 


| dM | < \", and | dN | < ao. 


Then 


P(t+ 6) — P(t) < | J Z 4 dt +X 


AN | u) 


tte—u) —F(t—u)|\dN(u) 


The first term goes to zero with « — 0, by a known result of Lebesgue 
(Wiener, p. 14). The second term vanishes as n increases. The third 
term approaches zero as t > ~, since F’ is bounded. The fourth term 
goes to zero with e — 0 by the uniform continuity of F’.* Hence 

lim | P(t + «) — P(t) | = 0. 

tox 
The hypothesis that F’ is continuous can be replaced by the condition 
that F’(t) approach a limit as ¢ > x. Also the uniform continuity of 
F’ could be replaced by a weaker but more complicated asymptotic 


condition. 
V. THE TRANSFORM II(-) oF Pr}W(-) = 0} 


Theorem 2: Let s be real and positive, and let 


ee 


e P(t) dt II(7); 


* It is assumed that F’ (t) = 0 fort < 0 
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then (16) implies 
sIl(s + a(s)) 


Proof: Equation (6) implies that 


nx 


| eRe?) at W(r,s)|[1 — sIl(r)], 


The left-hand side has no singularities in Re(7) > 0. But by Widder” 
(p. 58, Theorem 5b), W(7,s) is singular at + s + a(s), and so (18) 
holds. 

Theorem 3: If r > 0, and (16) holds, then 


l 


S(T) 


II(7) 


where s(7r) satisfies (and is determined uniquely by) the equation 
T— S(T) at s(T7)). (19) 


Proof: For given 7, there exists at least one s(7) > O satisfying (19). 
By Theorem 2, any such s(7) has the property 


I 


li(s(r) + als(r))) 


II( 7), 


because of (18). Since II(7) is a strictly monotone function of real rz, 
being of the form 


fe 30 


II(7) | e P(t) dt, P(-) 26. 
the solution s(7) of (19) is unique, and strictly monotone increasing 
In 7. 
When —a(s)/s — 1 as s — 0, we need some additional properties of 
a(-). For each ¢, there is a neighborhood N, of the positive real axis in 
which 


has no zeros. However, we shall need the condition ii of Section IT that 

there exist one neighborhood N of the positive real axis in which 
Eje*"'"} #0 

for all ¢ sufficiently large, say ¢ > 7. Define, for 2 


l ’ Kit 
g-(t) t log 4} ¢ Balke 
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Then g;.)(t) is harmonic in N for t > T, and 
|g(t) | = |¢ log 
< ("| log Pr{K(t) = 0} | < @, 
so that | g.(t) | S efor z © N, where ¢ is the constant of Theorem 1. 


Theorem 4: If 


lim inf ¢' log Pr{K(t) = 0} > —« (16) 
t+x 
and N is a neighborhood of the positive real axis such that 
AY, + gt). 2 N 

for all sufficiently large ¢, then a( 
in NV. 

Proof: For t > T define a class V; of functions on N as follows: V;, is 
the smallest class containing all g,.)(t’) for > ¢, and closed under the 
operations 


-) can be extended to be harmonic 


a » Ve > Max (Vy, Ue) 
outside A 
Pi inside A 
where A is a dise with A C N, and P is the Poisson integral operator on 
the dise A. These operations preserve the property of being bounded by 
the constant ¢ of Theorem 1. Now let 


sup v(2). 
\ 


By a standard argument (Ahlfors,” p. 197), the function u, 
monic in NV, and clearly 


-) is har- 


= sup g.(h 
We prove the reverse inequality: 
u(z) S sup g(t 
os 3 


[It is enough to show that the operations (a) and (b) preserve this prop- 
erty. For (a) this is obvious; for (b), we observe that 


l 8 
| wz2+re)d0s sup g.(f)). 
2n a ba >t 
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Hence for real z 


a(z) = inf sup g.(t), 
t t,>t 


lim u;(z). 


t+% 


The functions u,(-) are harmonic and monotone decreasing in ¢. Hence 


by Harnack’s principle they either tend to — ~ or to a harmonic fune- 
tion. The first alternative is ruled out by the inequality | g.(t) | < e. 
Hence a(-) is harmonic in NV. 
Theorem 5: Under the conditions of Theorem 4, let 
ais) 


lim — 
8s Ss 


If p > 1, there exists a largest root ¢ > O of the equation ¢ = a(¢), and 
s(r) > fas7r—>0. If p S 1, then s(r >Oas7r— 0. 
Proof: s(-) is monotone, nonnegative, and nonincreasing, so s(0+ ) = 
0 exists. If p > 1, then a(x) < —.x for some x > 0. However, for + > 0, 
a(s(r)) T S8(r) > —s(7T), 


so s(0+) > x > 0. Since s(-) is continuous we may let 7+ — 0 to find 
¢ a(¢) s(O+) > 0. 
Next, if p < 1, suppose that s(0+) > 0. Then 


a(s(QO+ )) 


—|, 
s(O+ ) 


which is impossible, since 


ats) 
>—p > —] 


Ss 


in a monotone decreasing manner. 


~ 


The case p = | requires a special argument. If s(0+) > O, then again 
a(s(Q+ )) 


s(O+ ) 


—| 
and we must have 
:. Oss S s(O+). 
s 


However, since a(-) is harmonic in a neighborhood N of the positive 
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real axis, this would imply, by the Cauchy-Riemann relations, that 
a(s) = s throughout N; but a(s) = —c for real s. Hence s(0+) = 0. 
The hypothesis about the existence of the neighborhood N is needed 


only for the case p = 1. 
Theorem 6: Suppose that P(«) = lim P(t) ast— ~ exists. Then 
0 fe 2, 
Pi Dm) 


l1—p ifp <1. 
Proof: Since r — s(7r) = a(s(7)). we find that 


T ats(t)) 
rII(7) =1- 
Si FT } 8( T ) 
Then if p < 1, the result follows from the previous theorem and a stand- 
ard Abelian theorem for the Laplace transform. If p > 1 then P(-) © L, 


and if P(x ) exists it must be zero. 


VI. ASYMPTOTIC RELATIONSHIPS 
We now prove some lemmata that exhibit some of the basic asympto- 
tic relationships between p, a(-), F(-), R(-), ete. 


Lemma 10: 


l—p 1 +a (0) 2 lim sup 1 — FE} K(t)/th. 


lt» 


Proof: By Jensen’s theorem 


SES K(t)/t}. 


Definition : 


mx 


Q(t Pri k(t) > ut du SimaxlO,A(t) — ¢]! 


Lemma 11: If Q(t) (t)ast— x, then 

ie ee 

i. If d/dt F} K(t)} also exists and approaches a limit as f — ~«, and 
rQ(-) can be differentiated in the usual way, then R(t) —~ last— «. 

Proof: 


Os CPi) = 1 


and 
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so that 
lim inf 1 — ES AK(t)/t. 


t»% 


lim inf ¢ F(t) 
t +2 
Then i follows by the previous lemma. To prove ii note that 
ad) 
” (t) Pri/K(t) > u} du. 


( t 


| 4 BEikK(t); + 1— R(t) + | 
It : 


The last term is positive, so 


d 


Rit)=i1il+1 7 E\K(t)! - F’(t). 


R(t) — 1. 
< 


Lemma 12: If R(t) — R(« ) QOast— «, then p S 1 
Proof: 
min[0,t—K(t)]) 


= E\e° ty 


=> Pri A(t) S tf. 


Hence also 


a(s) er 
=- limsupt log R(t). 


s >) t»x 


If R( ©) > 0, the lim sup is zero, so that 


ats) 


1+ = 0, 


s 


Lemma 13: If 


lim sup ¢ log R(t) 0, 


{+x 


If this lim sup is less than 0, then R(-) is integrable 


then p S 1. 
0 and p 


and R( = ) 


Lemma 14: If 


a 


0. Hence also P( = ) 


lim inf Q(¢) (0, 


t 


then p S |. 
Proof: 
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E{min [0,t — K(t)]} = Ejt — K(t)} — F(d), 


E{K(t)} : 
> lim sup | _ — rio], 


= lim sup [—Q(4)], 


=> — lim inf Q(?). 


t+x 


VII. CONVERGENCE OF Pr} W(t) Q!: MERCERIAN METHODS 


A Mercerian theorem (see Pitt,” p. 94) is one which, for example, 
enables us to study the asymptotic behavior of P(-) directly from that 
of the convolution 


F(t) = | P(t — w)R(u) du, 


i.e., that of F(-), without the intervention of ‘*Tauberian’’ hypotheses 
on the oscillations of P(-). Tauberian methods usually require P(- ) 
to be a slowly decreasing function; such methods are considered in 
Section VIII. 

In what follows the norm symbol (when used) refers to the total vari- 
ation of a function of bounded total variation. The subscripts ‘dis’? and 
“sing” (applied to a function symbol) denote the discontinuous and 
singular components, respectively. 

Theorem 7: lf R(-) is of bounded variation in any finite interval, and 


for some o = 0, 


inf | p(o — ir), > 0, 
(R, sing . inf Pdis\ o 


then P(-) can be represented by the inversion formula 


| pe’ t{—«ae dG ). dG, 
provided that 
Belt | ee" AR(u 


x 


is of bounded total variation. 
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Proof: The hypotheses (20) through (22), together with Theorem | 
. ve » 8s ° . ° 7 . 
of Wiener and Pitt, imply that there exists a function G,(-) of bounded 
total variation whose Fourier-Stieltjes transform is 


(o(o — ir], o fixed. 


We then solve the equation 


ot 
e F(t) | eo ™ P(t — ude" Ru) du 


“0 


Laplace transforms, obtaining 


60 ( ? 
te AD ¢) 
| ri ‘P(t) dt al ‘ 
Jo p(p + a) 
However, by Lemma 8, F’ exists and has Laplace transform ¢(- ). There- 
fore 


( P(t) 


oe, ot > 
and division by e ” completes the proof. 


This result provides another way of proving that P’(-) is slowly oseil- 
lating, as follows: 
Lemma 15: If the hypotheses of Theorem 7 are true with « = 0 


F’(-) is uniformly continuous and bounded, then P( - 
lating. 


, and if 
is slowly oscil- 


Proof: Let b be a bound on F’. Then 


P(t + 6) — P(t) | — u)|| dGo 


“7 
dGo(u) | + | rU+e= 
“—T 
| AGy(u) 
Choose first T large, then ¢ small, using the uniform continuity of F’ 
to let t— x. 


Theorem 8: If the hypotheses of Theorem 7 are true with « = 0, if 
F’ is bounded below, and for some constant A 


g(s) ~ : ss 04+, 





1298 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1961 


then 


A 
P(t) - ast— x, 
c) R(«) 
Proof: The hypotheses imply that P(-) can be written : 


P(u) | F’(u — y) dGo(y) 


x 


with G»(-) of bounded total variation; then also 


. t tp — yee 
P | P(u) du | 4 Z 4 dAGy(u 


Jo Jo t{—uUu 


There exists a constant 6 > O such that 
F(t) + bt 


is nondecreasing in ¢ 2 0; hence by a known Tauberian result ( Widder,” 
p. 197, Theorem 4.6), condition (23) implies that 
F(t) 


~>A20 as lt 
t 
the convergence being bounded, since 0 < F(t) S ¢. It follows that 


r P(u) du — A[Gy(+ x) — Gy —- = )] 


“0 


A 
R(x 


> 


The condition (20) of Theorem 7 guarantees that R(x) > 0. 
Theorem 9: If the hypotheses of Theorem 7 are true with o = 0, and 
F’ — F’( x) boundedly as t > =, then 


/ 


GA hd, 


Ri x) 


P(t) 


Proof: The hypotheses imply that 


P(t) | F’(t — u) dGolu 


x 


where Go(-) is of bounded total variation. The result then follows from 


. >a . 12 
a known Abelian lemma, e.g., Lemma | of Smith. 
Theorem 10: If the function R(-) — U 


-) is absolutely continuous, 
if F’( x ) exists, and if p(s 


) has no zeros in Re(s) = O, then 
F’( x 

P(t) — as t 
Ri xm ) 
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Proof: Set [R(-) — UC-)/ k(-), so that (10) implies (by differ- 
entiation) that 


F’(t) = / P(t — u) dR(u) = P(t) + / P(t — u)d,(R — U) 


) 0 


P(t) + P(t — u)k(u) du. 


Then, ast— x, 


P(t) + [ P(t — w)k(a) du — F’( x). 


By Theorem XVII of Paley and Wiener,” this, together with the con- 
ditions that P(-) be bounded and p(s) # 0 in Re(s) = 0, implies that, 


ast—- « 


F’'( @) F’( 2) 


P(t) — a 
1+ | ku) du 


R(x) 


since for t > 0 


R(t) 1 + | k(u) du. 
By a theorem of Pitt'’ (p. 115), the restriction that p(s) + 0 for 
Re(s) 2 0 can be weakened to p(ir) # 0 for real r. 


VII. CONVERGENCE OF Pr} W(t) QO}: TAUBERIAN METHODS 


The addition of ‘“Tauberian” conditions on P( - ),* 


such as the prop- 
erty of slow decrease, makes it possible to change or weaken the hy- 
potheses on R(-) and F(-) necessary to ensure the convergence of P( - ). 
The next result shows how the convergence of F’(-) in Theorem 9 ean 
be relaxed. 

Theorem 11: It 

i. For some integrable function &(-) with a nonvanishing Fourier 
transform A(-), and some number A, 


at x 


| k(t — u) dF(u) ~ A | kia) du ast— »: 


0 v x 
ii. P(-) is slowly decreasing; 
ii. R(-) is of bounded total variation with 


* Such conditions were briefly studied in Section IV 
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p(—tr) € 0, 


Reire || < inf | pais( —27 |: 


T 


P(t) 


R(x 


Proof: The conditions on p(-) imply, by Theorem 1 of Wiener and 
. ll 
Pitt,” that 
K(r) 
p( —ir) 
is the nonvanishing Fourier transform of an integrable function .(- ) 


such that 


“x 


| rit — w)P(u) du. 

The theorem to be proved then follows from Pitt’s form of Wiener’s 
fundamental Tauberian theorem (see Theorem 10a, p. 211 of Widder’). 

Instead of imposing conditions on F(-) and R(-), one can place them 
on the Laplace-Stieltjes transforms ¢(s) and p(s). We shall use the 
following Tauberian result: 

Theorem 12: Let y(s) be the Laplace transform, convergent in Re(s) > 
0, of a bounded function C(-) 2 0, vanishing for negative argument, 
and slowly decreasing; i.e., 

lim inf C(t 


el) 


t+x% 
Suppose that, for some function g(-), constant c, and s 


lim y(s) — 8s ¢ g( 7) 


o-» 


exists uniformly in every finite interval, —a S +r S +a. Then 
C(t ast— o. 
Proof: This is a variant of Ikehara’s theorem, Widder’ 
Define, for each \ > 0, 


= 2A, 


Ky L 
co ae 


mi\. 
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This is the Fourier transform of the function 


ky (v2) = 2X(2n) ; (= m) ‘ 
AX 


I(x) = (29) ; [ ky(a — w)[C(u) — cle ™ du 


lor « > O, set 


| sa 2A 
[ [(C(u) — cle ™“ du [ Ky(yje "dy 
2r “0 “—2ar 


»2 *e 
t 


l , iya Pal 
| Ky\(ye ™ dy [IC(u) — ele 
2 2h sah 


oT - 0 


Qa 


S ap" ive ; : 
z Ky(ye lye + ty) — Ce + ty) ‘e| dy. 


aT “—2r 


By (24) we may take the limit under the integral as « — 0, to obtain 


“2D 
° 1 > rh 
lim J,(2) = (29) | K(x )e ‘al y) dy 
7—2n 


€ (4 
"x 


| k(x — wlC(w) — e| du, 


the last identity following from Widder’ (p. 183, Corollary le). The 
function Ay(-)g(-) belongs to L,, and so, by the Riemann-Lebesgue 
lemma, 


“x 
(2x) ° kylx — wlClu) — eldu— 0 


“0 


Since C(-) is bounded and slowly decreasing, we conclude from Wid- 
der’ (p. 209, Theorem 9), that 


Ct 2) c asu—- @, 


The theorem just proved implies directly the following consequence 
Corollary: Suppose that 
i. For some number L and function g(- ) 


p(s) I 
¢ y) 
— — gr) as ao — QO, 
p(s) 8 
uniformly in every finite interval; 
i. P(-) is slowly decreasing. 
Then P(t) > Last— x. 
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Some simple conditions on F(-) and R(-) which ensure that  hy- 
pothesis i of the corollary obtains are described in the next result: 

Theorem 13: If 

i. For some number LZ the function 


at 
F(t) — L | R(u) du 
“0 
(assumed to vanish for t < 0) is of bounded total variation: 


ii. R(-) is of bounded total variation, and 


p(—tr) € 0, Rsing |) < inf | pais( —t7) 


then there exists a u(-) such that 


gis) f, 
= vA &) > val T) > Q 
p\ Ss) S 
uniformly in 7, and yw( 7) is the Fourier-Stieltjes transform of a function 
of bounded total variation. 
Proof: The Laplace-Stieltjes transform of (27) is 


L 


Ss 


p(s), Re(s) = 0. 


The condition ii ensures, by the Wiener-Pitt theorem, that [p( —ir)] ' 
is the Fourier-Stieltjes transform of a function of bounded total varia- 
tion. It follows that u( —77) is also. But u(-) is the transform of P(-) — 
L, and so extends analytically into Re(s) > 0 with the representation 


ex 


| ( ‘dG(u), G < @, 
Then 


dG(w) 


dG(u) | + [ dG 
/7 


Let « > 0 be given, and pick 7 so large that the second term on the 
right is less than €/2; then pick o so small that the first term is also. 
This proves the result. 

The condition i of Theorem 13 is satisfied, for example, if F’( < ) 
exists and both of 
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F’( a2) — F(t) 
and 
R(o) — R(t) 


are absolutely integrable, since Lemma 8 showed that F’(t) exists as 
soon as R(-) is of bounded variation. These facts suggest that, if (27) 
is true, then an elementary argument can be used to show that P(t) 
— L, and use of the complex Tauberian theorem can be avoided, as 
follows: 

Theorem 14: If 

et 
i. F(t) — L | R(u)du is of bounded total variation; 


“vu 


ii. p(—ir) # Oand | Rying |) < inf | pais( —tr) |; 


ili. P(-) is slowly decreasing; 
then P(t) ~ Last— «. 

Proof: By Lemma 1, F’(-) exists, and so by the arguments used in 
proving Theorem 13, u(s) is the Laplace transform of P(-) — L, with 


i 
| P(u) — Lidu < o. (28) 
Now suppose that P(u) — L > e for arbitrarily high values of u. Since 
P(-) is slowly decreasing, this implies that P(u) — L > €/2 through- 
out infinitely many nonoverlapping intervals (y — ¢, y + ¢) on the 
u-axis. This contradicts (28). A similar argument shows that P(w) — 
L < e cannot be true for arbitrarily large u. Thus P(t) — Last—- «. 
Heretofore in this paper we have relied heavily on the condition 
p(—ir) # 0 which, if R(-) is of bounded total variation, implies that 
lim Pr{K(t) < t} > 0. 
tox 
We now consider the case R( = ) 0; not attempting an exhaustive 
analysis, we content ourselves with the following result : 
Theorem 15: if 
i. R(-) is of bounded total variation with R( < ) 0: 
ii. p( —ir) # O for real r ¥ 0; 


er+T 
P(a) du 


saat Ba l 
lim lim sup oT 
»X — © 


T +X ‘ 


sf 


F’( 2 ) exists; 
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then 


Proof: 


E{max [0,1 — ('K(t)]}, 


(1 — ¢'K(t)|P(do), 


aie 
= Pr{K(t) s t} = R(t). 


Hence F(t) = o(t) ast > ~ and F’(x) = 0. Equation (10) implies 
that 


F’(t) P(t — u) dR(u) - 0 as ft 


and also that F’(-) is bounded. Let k(-) be any function of L, whose 
Fourier transform does not vanish. Then 


exe 


| k(t — w)F (ua) du | y(t — u)P(au) du. 
where y(-) is the convolution 


px 


y(t) k(t — u) dR), 


and belongs to L;, with a nonvanishing Fourier transform. The inter- 
changes in orders of integration can be justified by Fubini’s theorem. 
By a standard Abelian result (for example, Lemma 1 of Smith'’), the 
left side of (29) tends to zero with increasing ¢t. We can now use an ex- 
tension of Wiener’s general Tauberian theorem due to Pitt" (Theorems 
3, 5, 6) to conclude that P(t) — 0 as t > 0. 

The chief practical difficulty with the theorem just proven is the 
verification of the condition iii. 


IX. LIMIT THEOREMS FOR Pr} W(t) 

Once the limiting behavior of P(t) as t — *% has been determined, 
one can use Abelian arguments on (4) and (6) to prove limit theorems 
for the distribution of W(- 

Theorem 16: If 


: £(f) Pr} W(t) 0} — Pri W(o ) > thas 
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oo » . . ’ K(u) uu 
il. for each + # O, the function Ei} e'! “)) belongs to L;(0,  ) 
and vanishes at * (as a function of w); 


ili. the function 
A(r) = Pr {W(-) 
is continuous at 7 = QO, with A(0) 
then there is a distribution function 
BC+) = PrW Cao) Ss « 


such that 


. ’ irWit 
lim [fe 1 = X(?r), 


t+® 


lim Pri W(t) < w! Liw), 


k+»x 


at continuity points of L(-), at least. 
Proof: A direct application of Lemma 1 of Smith’ 


to the real and 
imaginary parts of (6) with s = —ir, 7 real, proves (31). The rest is 
a consequence of the standard continuity theorem for characteristic 
functions. Formula (30) is a generalization of the Pollaezek-Khinchin 
formula; this may be verified by letting A(-) be a compound Poisson 
process. 

If Pr} Wi «) = O} = O, and condition ii of Theorem 16 is fulfilled, 


then a similar argument shows that 
’ it W 
Eie"™""! +0 ast— « 


for all + # 0, so no limiting distribution exists. 

Alternatively, one can apply similar Abelian methods directly to for- 
mula (3). For convenience we shall use, instead of (3), the w-integrated 
form (w = 0) 


pu et+ 


Pr {W(t) S ub du = | Pr {K(t) S u} du 


— | Rit — uw) Pr |W (a) O} du, 
in which R(t,u,w) has been replaced by R(t — u,w), in accordance with 
the stationarity condition (7). 
Theorem 17: If 
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i. For each w > 0 


/ [Riuw) — R(u0}|du = r(w) < @; 
i. R(t) ~last—- ~; 
ii. P(t) = Pr{ W(t) 0} — Pr{W(o) = Oi ast ~; 
then, for w > 0, 


lim [ Pr {W(t) Ss u} du = w — r(w) Pr {W(o) = 0}. 
Proof: By subtracting the case w = 0 of (4) from the case w > 0, 
we find 


| Pr {W(t) s wu} du Pr {K(t) S u} du 


“oO 


[R(t — uw) R(t — uO)|PCu) du. 


> ?t, then 
Pri A(t) S uf 2 Pri A(t) 
Hence, by ii, 
lim [ a Pr {K(t) S u} du = w. 
tox “t 
By i. and Lemma 1 of Smith,” the last term of (32) approaches 
r(w) Pr{W( 20) = O}. 
To clarify the meaning of the condition i of Theorem 17, we note 
that 
(tw) — R(t0) = Pri0 < K(t) —t S wh, 
r(w E\amount of time that 0 < K(t) — < wi. 


Other limit theorems can readily be obtained, e.g., for the nonintegrated 
equation (3), as soon as suitable differentiability conditions are imposed. 
Since no new principle is involved, we shall leave the matter here. 
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A Nonlinear Integral Equation from 


the Theory of Servomechanisms 


By V. E. BENES 


(Manuscript received March 28, 1961) 


The equation x(t) = s(t) — k*F(x)(t), where s(-) is a given signal, 
F(-) ts a nonlinear function, k(-) is the response of a linear system, and 
* denotes convolution, describes a general class of servomechanisms. Prop- 
erties of a solution x(-) can be established by finding a fixed point in a 


specific set of a function space, using Schauder’s theorem. 


I. INTRODUCTION 


A general class of nonlinear servomechanisms is described by the 
integral equation 


et) =-att) — | k(t — w)F(xr(u)) du, eed 


where s(-) is an input signal, &(-) is an impulse response function, and 
F(-) is a nonlinear function. The equation (1) represents the system 
diagram of Fig. 1, with F(-) as above, and with A(-) the transfer fune- 
tion corresponding to k(-). We assume that F(-) satisfies the uniform 
Lipschitz condition 


|F(a) — Fiy)| SBlr—- yi, 


and that F(0O) = 0. 

A classical method for studying nonlinear servomechanisms like that 
of Fig. 1 is to specify exactly the nonlinear element F(-), to assume that 
the response k(-) is the Green’s function of a differential operator of low 
order, and to use some sort of phase-plane analysis. This method has two 
theoretical disadvantages: it lacks generality, and, when applied, it 
tends to give more information than is needed; thus it provides detailed 
knowledge about a restricted class of cases. 

In this paper we shall use a method that has the opposite characteris- 
ties: it provides a small amount of highly relevant information about a 
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large class of cases. We shall exemplify the use of Schauder’s fixed point 
theorem for studying solutions x(-) of (1) without specifying either k(- ) 
or F(-) in detail. We establish definite properties of x(-) by finding a 
fixed point (corresponding to a solution of the equation) in a specific 
set of a function space. Since the function space and the set can be chosen 
in many ways, depending in part on what properties of a(-) are of 
interest, such a method can be used for a wide class of problems. The 
theory in the sequel is therefore restricted to sample results for the 
function space Ly of square-integrable functions, and is to be regarded 
only as a particular example of the method described above. 


II. FUNCTIONS OF FINITE ENERGY 


In many situations it is desirable that the convolution term 


ax 


| k(t — u)F(a(u)) du 

© x 
follow the input signal s(- ). The error in this approximation is then x(- ) 
itself. It is then reasonable to work in the space Le of real, square- 
integrable functions, i.e., functions of finite energy. Accordingly, we 
assume that k(-) and s(-) are in Le , and we seek to bound the energy 
of a solution x(-) of (1). 

Now the functions of L. cannot assume values appreciably different 
from zero on sets of arbitrarily large measure. Hence they may be viewed 
physically as pulses. By restricting s(-) and the solution 2(-) to Le 
we are therefore studying the response of the system of Fig. | to certain 
pulses of finite energy. We shall be particularly interested in finding out 
how much of the energy of x(-) lies outside a given time interval. 


The norm symbol - | is used to denote the square root of the energy 
of a function. Thus for x(-) in Le , 


(/ x(t) P at) : 


and a sequence of functions |.c,(-),m 2 O} is said to converge to a fune- 


tion 2(-) in Le-norm if | « — 2x, | approaches zero with increasing n. 


Iil. HYPOTHESES AND PRELIMINARY RESULTS 


If x(-) isa funetion of Le , we let 


rx 


Tx(w) (Da) ig e*'r(t) dt 
4 





NONLINEAR INTEGRAL EQUATION 


NONLINEAR LINEAR 
FUNCTION SYSTEM 


| 


| | 
5 FC) K+) 
“ [| 


a ee Cee 


sit) +> x(t) 
“\ 


_—> 





to 








Fig. 1 — System diagram of servomechanism. 


denote its Fourier transform; for k(-) © Le , we reserve the special nota- 
tion 
Tk(w) = K(w). 


The operator H on Ly» is defined by the condition 


Hx(t) = | k(t — u)Flalu)) du. 


Lemma 1: If K(-) is bounded in w, then H is a continuous transforma- 
tion of Le into itself. 

Proof: For x(-) L, the Lipschitz condition on F(-) yields 

F(x) | SB! ax/!, so that F(x(-)) © Le. It isa known result that the 
convolution of two Ly functions belongs to Le. Hence Hx Le. Also, 
by the Parseval relations, 


Hx — Hy|\ = | K(w) |° | TF(x) — TF(y) |? da 


. x 


< sup | K(w) |’ || F(x) — Fly) 


w 


< psup  K(w) 


which shows that / is continuous. 
Now let w(-) be a given non-negative function of Ls , and let S be the 
set of all w(-) in Le such that 


x(t) | S w(t), almost everywhere. 


Lemma 2: S is closed and convex. 
Proof: Let x,(-) © S be a sequence of functions approaching x(-) in 
Ly. Then for ¢« > O and u(- ) Lebesgue measure, 
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However, 
x(t) — Xn(t) | + | tal b) 


| 


a(t) — z,(t) | + wit). 
Hence | x(t) | — w(t) > ¢€ implies | x(t) — z,(t) | > ¢ and 
pit: | a(t) — x,(t) | > ef 2 wit: | z(t) | — wlt)> ¢. 
Letting n approach infinity on the left, we find that 
{t: ;}a(t) | — wit) > e 
has measure zero for each « > 0. Hence almost everywhere 
a(t) | S w(t), 


and so S is closed. The convexity of S is obvious. 
We denote by B the subset of functions x(-) of Le which are “band- 
limited” to the frequency interval (—Q,Q), i.e., representable as 
x(t) (27) | eT x(w) dw. 
The physical interpretation of membership in B is of course that the 
sinusoidal oscillations into which a function is decomposed by the 


.e., Tx(w) = 0 for | w Q. 
The input signal s(-), and the response /( - ) will be assumed to belong 
to B. If we define the operator J on Ls by 


Jat s(t) — H(t), 


then the range of J is a subset of B. It follows that any solution of (1), 
i.e., any fixed point of J, will belong to B as long as s(-) and k(- ) do so. 
Such a ‘‘band-limiting”’ restriction is natural physically, because of the 
known attenuation at high frequencies characteristic of physical circuits, 
and it will have an important mathematical role in finding fixed points of 
J. In particular, we note that JS C B. 

To obtain a bound on the amount of energy that a solution 2(-) has 
outside a given interval, we shall suppose that the non-negative function 
w(-) of Le, used in the definition of S, satisfies the integral inequality 


s(t) | - Mt — uw) | wlu) du S wilt). 


This inequality may be thought of as defining an associated linear 
This inequality may be tl ht of lefininy ted / 
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problem; it will be used to ensure that /Jx(-) belongs to S if x(-) does. 
The nonlinear function F(-) enters formula (3) only via its Lipschitz 
constant (of order 1) B. 

Lemma 3: If (3) holds, and F(0) = 0, then JS CS. 

Proof: Let «(-) belong to S. Then 


Jz(t)| < | s(t) + | k(t — u) | | F(a(a)) | 


x 


eL 


k(t — w) | wha) du 


s(t) +8] 


x 
w(t). 


Our preliminaries are completed by 
Lemma 4: Sf) B is compact in Le . 
Proof: Let E = || w ||". The functions of SM B are (uniformly) equi- 


continuous with modulus 


20k\ sin Qe\ 
( T (1 Me 


This follows from the inequalities: 


xt +) —2(0| $ 2x) | e — 1| Tr(w) dw 


(1 — cos we) aw) 
a sin 
T Qe : 


the last bound on the right being independent of ¢ and .x(-). Also, the 


inequalities 


Tx(w) | dw 


show that the functions of SM B are uniformly bounded. 
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Since both S and B are closed sets, it suffices (to prove Lemma 4) to 
show that Sf B is sequentially compact. Let x,(-) € SAB be an arbi- 
trary sequence of functions. The x,(-) are uniformly bounded and uni- 
formly equicontinuous. By a standard diagonal argument using the 
o-compactness of the real line, we can select a subsequence x,,( +) which 
converges to a function x(-) uniformly on any compact set. We have 


TAT) | & | wt) — att) | + 1 eal?) |, 
et 


riu) | du s | rlu) — &p,(u) du 
ba t 


et 
(wa) r(u) — x,,(w) | du + | t,(u) | du. 


t 


For each fixed ¢, the first two terms on the right of the last inequality 


approach zero as m becomes large, and the third term is at most |) w |" = 
E uniformly in ¢. Hence | x | S FE and x(-) © Le. Using Minkowski’s 
inequality, we find 


(| = te \° - “«— uv, * du) 
(| r(u) |" iu) + (| Uy, U) . iu) 


+ (| ee du). 


The first two terms on the right can be made arbitrarily small by a large 
enough choice of ¢, uniformly in m; for ¢ fixed, the third term goes to zero 
as m — x. Hence the x,,(-) converge to x(-) in Le, which proves 


Lemma 4. 


IV. PRINCIPAL RESULTS FOR GENERAL F(-) 


Theorem 1: Let s(-) and k(-) belong to B, with K(-) bounded, let 
F(0) = 0, and let the integral inequality (3) obtain. Then there exists 
a solution x(-) of (1) in the set Sf B, with the properties 


r(t)| S wilt (and so |} x!) S | w 


St} 


rit) (Yar) eT x(w) dw. 
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Proof: J is a continuous mapping of the closed convex set S into its 
compact subset Sf B. By the “strong form” of Schauder’s theorem,’ ” 
there exists a point x(-) in SA B such that « = Jx. The properties 
listed above are immediate consequences of belonging to SN B. 

The following slight modification of Theorem 1 involves no new prin- 
ciple: 

Extension: Hf, in addition to the hypotheses of Theorem 1, 

Bsup | K(w) | < 1, 


then to the conclusion of Theorem | can be added 


s 


1 — Bsup | K(w) 


Proof: Let a denote the bound on the right of the last inequality. Then 
the intersection Q of S with the closed ball of radius a is closed and 
convex. With condition (4), and x(-) © Q, the inequalities 


Jax |) S| s)) + (|. K(w) |? | TF(2) |’ ds) 


S || s || + sup | A(o) | || F(z) 


<i s|\+Bsup| K(o) 


show that JQ ¢C Q/ B. Since the topology is Hausdorff, QM B is a 
closed subset of the compact set SM B, so it is compact. The result 
follows from Schauder’s theorem. 

V. PRELIMINARIES FOR F(-) NEARLY LINEAR AT THE ORIGIN 


It is clear that stronger assumptions concerning the nonlinear function 
F(-) are necessary if we are to obtain results that make the energy of 
x(-) less than that of s(-). A particularly important case is one in which 


F(t) = t+ o(t), ast—0O; 


that is, F(-) is linear near the origin. 
Let F(-) have the form [where n(k) are integers, n(1) = 1] 


F(t) = er’(-1)" fe , 
k=l 


with f, > O,f; = 1,n(k + 1) > n(k), the series converging for 
where 
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: > w/k 
lim sup | f, |” 


kx 


Suppose also that there is a number a, 0 < a < p, such that 
fork 2 1. 


Then |¢)| < a implies that F(t) has the sign of ¢ and 
F(t) —t| Sfejt|"”, (7) 


for then F(-) is represented by a power series of alternating sign whose 
terms are monotone in magnitude. 

Since we are comparing (1) to a linearized version of (1) obtained by 
setting F(t) = ¢t, we shall need the solution of the resulting linearized 
equation: this is a function y(-) defined by its Fourier transform 


Ts(w) 


Ty(w) é 
d 1+ K(w) 


Similarly, the closed-loop transfer function of the linearized loop is the 
Fourier transform 


K( w@) 


1+ K(w) 


of a function 2(-). These definitions will be justified in the theorem to be 
proved. 

By dint of our stronger assumptions on F(-), we can use a different 
integral inequality from (3). We assume instead that there exists a real 
non-negative function v( - L» such that 


px 


y(t) ee ot — z(u)| du Ss v(t). (8) 


¢ x 


With this inequality playing the role of (3), the method used to prove 


Theorem | can be applied almost without modification. 

However, since the integral inequality (8) is nonlinear in v( - ), we shall 
digress a little and give a sufficient condition for its validity. One way 
to do this is to find a non-negative v(- ) L». that satisfies (8) with 
equality, i.e., is a solution of the nonlinear equation 


‘ 


v(t) y(t) | + fo | vee'(t — uw) | zt) | du 


Mov(t) 


We shall show how the classical contraction principle for complete metric 
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spaces can be used to find a solution v(-) of (9), i.e., a fixed point of 7. 
Such a result is exemplified by 
Lemma 5: If for 


000 tint? 
6 = +f ziu)|du< « = 
) Je Z u adu< ; a bn(2) 


we have for some 6 > O 


| 
sup |y(w)| < afl — - 6, (10) 
u n(2) 


then the map J/ is contracting on the closed set Y of 2(-) Lo such 


that 
tl-) 20 


ess sup x(u) S a — 6. 


‘ 


Proof: Consider the equation for a > 0, 


sup | y(u) | + ba" (11) 


u 


The left-hand side has unity slope at the point a@ = a, and the in- 


equality (10) implies that at this point the left-hand side is less than 
the right. Hence (11) has two roots in a > 0, and, for « Y, 


po 


sup | Mer(t)| S sup | y(w)) + fo e''(t — uw) | z(u) | du 


t 


x 


<S sup / y(u)) + ba" : 


Za é. 


Thus WY Cc Y. To show that Y is a closed set we recall that convergence 
in Le implies convergence in measure. Let x, Y converge to x in Le ; 
then, asn — ~, 


xr(t)| 2 @§ 0 


gt: 1 (6) 
for each « > 0. However, almost everywhere we have 


— a(t) — a,(t) xl -. —r(t)| ta —, 


and so 


ue: x(t) < —€ < 
wit: a(t) 2a—d6 + ef) ~ 


mst: | tb) — te) | = é€. 


where u(-) denotes Lebesgue measure. Letting « — 0, we find 
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ess sup x(t) S a — 6, 
ess inf x(t) = 0. 


To show that WV is contracting on Y, let «(-) and y(-) be arbitrary 
functions in Y. Then 


Mx — My! She Tz |" | Tx** ” ae dw) 


However, on Y 


n(2)(@a — 6) ~ 


n(? 


2” ie r—- y) n(2)(a — 5) 


and so, since z 


Mz — My || S fe sup | Tz | n(2)(a@ — 8) 
< bn(2)(a — §)"° ai 
But bn(2)(@ — 6)"”' < 1, so M is contracting on Y. 
Lemma 5 implies, by the contraction principle, that there exists a 


unique solution v(-) of (9) in the set Y, obtainable as the limit of 
successive approximations starting at any point of Y. 


VI. PRINCIPAL RESULTS FOR F(-) NEARLY LINEAR AT THE ORIGIN 
Let R be the set of functions x(-) of Le that satisfy the condition 
almost everywhere, 


where v( - ) is the function in the inequality (8). The argument of Lemma 
2 shows that FR is closed and convex, and that of Lemma 4 shows that 
R11 B is compact. 

Theorem 2: If a > 0 and F(-) have the properties (5) and (6), and 
if k(-) and s(-) both belong to B, with k(- ) L, and K(w) ¥ —1, 
and if (8) holds with 


then a solution «(-) of (1) exists in B with the properties 
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a(t)| < v(t), 


Q\' 
eet Ss x 
wT 


Proof: Since K(w) is continuous, and tends to zero at %, it must be 
. re , 3,4 
bounded away from —1; hence by the Wiener-Lévy theorem,” 


[1 + K(w)| 
is the Fourier transform of an integrable function g(-), and so 
Ts(w){1 + K(w)]"' 
is the Fourier transform of a function y(-) of Lo B, and also 
K(w){1 + K(w)|] 
is the Fourier transform of a function z(-) of 1, A Le B. 
We now write (1) as 


oa] 
v(t) + / k(t — wala) du = 


ke) 
s(t) — | k(t — wIF(a(u))— xlu |} du. 


x 
Taking Fourier transforms gives 
Tx = Ty — Tz T[F(x) — oI. 


We shall therefore consider the equivalent equation 


x 
x(t) = y(t) aa | 2(t — w)|F(r(u)) — xlu)| du, 


se. Cope}. 


This is of exactly the same form as (1); in particular, G is a continuous 
map. To apply Schauder’s theorem it remains to verify that GR C R. 
For r(-) CR, 


pw 


Gu(t)| S |y(t)| + | F(a(t — u)) — x(t — uw)! | zl) | du. 
? x 
But | 2(-) | < a, by (12); so (7) gives 
F(x(t — u)) — a(t — u)| S fo| x(t — u) 
“(4 — u)|, almost everywhere. 


Hence (8) implies that | Ga(t) | S v(¢). 
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The energy of the solution of the linearized equation with the input 


‘ dw 
Jo 1+ K(w) 


The gain of the closed linearized loop at the frequency @ is 


signal s(-) is 


K(w) 


13) 
i+ K(e) — 


It is reasonable to expect that, if the function F(-) is close to being 
linear, then the solution x(-) will have an energy close to that of the 
linear solution y( - ), in the sense that, for some constant & that approaches 
unity as F(-) becomes linear, we have 


rii Selly 


A precise form of this intuitive idea, depending on the linearized loop 
gain (13), is given in 

Theorem 3: If, in addition to the hypotheses of Theorem 2, it is true 
that 
2. K(w) 
‘fe sup —— Ry 
1+ K(w) 


> 


then to the conclusion of Theorem 2 may be added 


s min ( J ‘ v ), 
l—c 


Proof: The intersection V of R which has the closed ball of radius 
y (1 — c) is closed and convex. With condition (14), and 2(- ) 
V, the inequalities 


) 


Gx | Tz\| TIF(x) — al| iw) 


+ sup | T 


show that GV C Vf B. Also, VN B is a closed subset of the compact 
set RM B. So the result follows from Schauder’s theorem. 

The condition (14) used in Theorem 3 relates the maximum gain of the 
linearized loop with the second nonzero coefficient fo in the expansion of 
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F(t) around the origin, and with the power n(2) associated with this 
coefficient. 
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The Observed 33 to 90 kme 
Attenuation of Two-Inch 


Improved Waveguide 


By A. P. KING and G. D. MANDEVILLE 


(Manuscript received February 16, 1961) 


The TE losses in long two-inch inside diameter circular waveguide lines 
were measured over the frequency range of 33 to 90 kme using shuttle-pulse 
techniques. Experimental results obtained with two types of wavequide—one 
solid-wall copper and one helix—are presented. Loss curves for both types 
are given, along with the theoretical loss curve. Statistical results on TE 
loss variations due to mode conversion in the solid wall guide are also given. 
A new method for making shuttle-pulse loss measurements with a single 
backward-wave oscillator is described. 


I, INTRODUCTION 


The conditions favorable for low TE, transmission loss in waveguide 
occur when the operating frequency is remote from cutoff and many 
other modes can consequently coexist. Two-inch inside diameter copper 
pipe can, for example, propagate 80 modes at 33 kme, nearly 300 modes 
at 60 kme, and about 700 modes at 90 kme. When so many modes can be 
propagated, the presence of imperfections in the waveguide will intro- 
duce coupling between the TE, and other modes. The spurious modes 
which are excited at such imperfections increase the TE, loss, and a 
distribution of such imperfections can produce mode conversion-re- 
conversion effects’ which cause the TE loss to fluctuate in an unpre- 
dictable manner with frequency. 

Imperfections which occur in a long line can be divided into two cate- 
gories: joints between sections and imperfections in the pipe itself. Dis- 
continuities at the joint comprise tilt, offset, and change in diameter. The 
imperfections of the pipe are of a more distributed nature and comprise 
changes in diameter, ellipticity, and the more or less random deviations 
of the axis from a straight line. It has been shown by experience that with 
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care both the joints and cross-section tolerances of the pipe can be con- 
trolled so that increases in the TE: losses from these sources are quite 
small. An analytie study and experiments by Rowe and Warters’ have 
shown that the variations in axial straightness contribute appreciably to 
increased losses and fluctuations of the TE, mode. The theory shows 
that these mode conversion-reconversion effects occur when the period of 
these variations equals the beat wavelength.* The beat wavelengths of 
the TE»; mode with TE. and with the TE,,; modes over the 33 to 90 kme 
band fall in the 1- to 5-foot range. In a waveguide, when the periods of 
components of the axial variations relative to a straight line also fall in 
the 1- to 5-foot range, the resulting interaction causes the average TEy 
loss to inerease and the actual loss to fluctuate with frequency. 

It is the purpose of this paper to present the experimental results ob- 
tained with two types of waveguide made with very precise tolerances. 
One of the lines employed conventional solid-copper-wall waveguide, the 
other was an all-helix waveguide. The measured data cover a 33 to 90 
kme frequency band. A brief description of the measuring technique 
which was employed for these measurements is included. 


Il, DESCRIPTION OF LINES 


The TE loss measurements were made on straight two-inch inside 
diameter lines laid in a straight steel conduit. The individual sections 
were about 15 feet long and were joined together with special threaded 
couplings’ which minimized offset and tilt at the joint. ““O” rings between 
the coupling and the pipe were employed to seal the line. It was impor- 
tant to pump out the line to remove the oxygen and water vapor in 
order to avoid added losses due to absorption effects.” The presence of 
atmospheric oxygen at standard pressure increases the TE» line loss by a 
factor of 20 at 60 kme. The presence of water vapor increases the losses 
to a lesser extent over the 50 to 90 kme band, but the increase rises 
rapidly with frequency above 100 kme.’ 

The solid-wall waveguide comprised oxygen-free high-conductivity 
copper of commercial manufacture to very close tolerances. The helix 
waveguide’ line (Fig. 1) was constructed in our laboratory by winding 
No. 37 wire on a precision mandrel. Over the helix winding a layer of 
glass fibers, coated with conductive material, was wound. This layer 
provided the necessary loss to attenuate the spurious modes. These ele- 
ments were bonded together and to the outer steel shell with epoxy resin. 


*The beat wavelength, Ag ,of mode 1 with mode 2 is defined as 1/A, 


1/d (1/2), where A; and A» are the guide wavelengths of the individual modes 
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Fig. 1 Helix waveguide line. 


Il. MEASURING TECHNIQUE 


The TE: losses were measured by the shuttle-pulse method (Fig. 2). 
This was accomplished by placing an iris at the input and a short at the 
far end of the section of line to be measured. The iris is a plane traverse 
metallic mesh with uniformly spaced holes which transmits only a small 
part of the incident pulse and decouples the measuring equipment from 
the line without introducing spurious modes. In the measuring equipment 
the iris was proportioned to transmit approximately 1 per cent of the 
incident energy. Thus the received signal pulses from the test section 
were reduced in level by twice this amount or about —40 db relative to 
P,. While this power loss tends to place a lower limit on the power of 
the signal source, a level in the 1 to 5 milliwatt range has been found to 
be adequate. 

The shuttle-pulse measuring technique with an iris has several attrac- 
tive features. It separates the input waveguide circuits which have large 
losses from the low loss waveguide being measured. With a line whose 
one-trip loss is a fraction of a decibel, the shuttle-pulse method permits 
the loss to accumulate for a number of trips, thus permitting an im- 
proved measurement accuracy. The results presented herein were based 
on measurements of 10 to 30 round-trips. 

The 33 to 58 kme measurements were made with a two-klystron set-up. 
The signal pulse was obtained from one of the klystrons by introducing a 
short pulse in the repeller circuit so that oscillations occurred only during 
the pulse interval. The other tube provided the continuous-wave beating 


oscillator signal. 
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Fig. 2 (a) Measuring circuit and (b) modulation scheme for shuttle-pulse 
method of measuring 7'Eo; losses. 


For the 41 to 90 kme band backward-wave oscillators were used. A 
single oscillator tube was employed in a special measuring circuit [Fig. 
2(a)] proposed by Ring’ to provide both the transmitted signal pulse and 
the receiving beating oscillator power. The modulation scheme required 
to do this is shown graphically in Fig. 2(b). Here the modulating pulse 
amplitude of a few volts is adjusted to a voltage v such that at the top 
part of the pulse the oscillator frequency is shifted to frequency fy . The 
steady-state unmodulated frequency of the oscillator is f;. The inter- 
mediate frequency is fy — fi = 70 me. Thus, after the pulse, the beam 
voltage is restored to the steady-state condition and supplies continuous 
beating oscillator signal between signal pulses. The 0.08-microsecond 
pulse width is dictated by the bandwidth of the receiving circuitry; the 
100-microsecond repetition rate permits the energy at the signal fre- 
quency fy to decay before a new pulse is generated. The beating oscillator 
power impressed upon the converter is the reflection of the forward power 
from the front face of the iris; it is higher than the pulses from the 
section of line under test by a factor of twice the iris coupling, which is 
about 40 db. This condition ensures a good signal linearity. 

Ring’s circuit is an important advance in measurement techniques. 
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The advantages are that the frequency of a backward-wave oscillator 


can be closely set and the single-tube pulse method provides more 
convenient and faster measurements. 

The waveguide components mounted between the signal source, the 
crystal converter, and the Thy wave transducer were in rectangular 
waveguide and were of more or less conventional form. The wave trans- 
ducer has been described elsewhere.” A helix waveguide mode filter was 
connected at the TE output of the transducer to minimize spurious 
mode impurities and a special taper'’ matched the transducer to the two- 
inch waveguide line. The taper section was designed to minimize con- 
version to higher-order TE», modes, especially the TE 2 mode. The iris, 
which has already been described, was placed at the input of the section 
of line to be measured. 

Three helix waveguide sections 10 to 15 feet long, which served as 
spurious-mode filters, were employed in the solid-copper test line. Two 
of these mode filters, one at each end of the line, were necessary in order 
to eliminate accumulative mode conversion-reconversion effects with sue- 
cessive trips and to ensure that each trip had the same TE, transmission 
characteristic and loss. The third mode filter was placed approximately 
in the middle of the line. Its purpose was to reduce the continuous length 
of solid-copper waveguide to half the length, from 400 to 200 feet for the 
line described. The presence of this additional mode filter had a negligible 
effect upon the average TE»: loss, but the shorter run of solid-copper 
waveguide reduced the length over which continuous mode interaction 
could occur and resulted in smaller variations in TE; loss with frequency. 


IV. EXPERIMENTAL RESULTS 


The observed data were plotted in terms of the one-way transmission 
loss as a function of frequency. The measurements were made on a 
round-trip basis and have been corrected for end losses. These include 
both the end loss and coupling loss at the iris and the loss due to the 
short at the end of the line. 

The measured data for an all-helix line, a solid-wall-copper waveguide 
line, and the theoretical loss for solid-wall copper are plotted in Fig. 3 in 
db per mile as a function of frequency. 

The smooth dashed line curve in Fig. 3 represents the loss which is 
based on the measurements of helix lines varying in length from 415 feet 
to over 1500 feet. These measured data have been extrapolated so that 
the ordinate values express the loss in db per mile. The theoretical loss 


> 


curve is also in db per mile. At the lowest frequency, 33 kme, the meas- 





1328 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1961 


uw 





b 


SOLID WALL 
COPPER 


\ 


Nw 


THEORETICAL LOSS, 
PERFECT COPPER 


TRANSMISSION LOSS IN DECIBELS 








° 





45 50 55 60 65 70 75 
FREQUENCY IN KILOMEGACYCLES PER SECOND 


w 
o 


Fig. 3 TE, loss versus frequency. 

ured loss is 3.7 db per mile; at 60 kme, the loss is 2.2 db per mile; and at 
90 kme, the highest frequency measured, the loss is 1.7 db per mile. 
At 55 kme, the helix loss is in good agreement with values determined 
on individual sections with a Q-measuring set.'’ The ratios of the meas- 
ured to the theoretical attenuation at these frequencies are 1.11, 1.67, 
and 2.3 respectively. Thus, while the measured loss of this line relative 
to the theoretical value increased with frequency, the actual loss was still 
decreasing at the highest frequency measured. It is, therefore, presumed 
that the transmission loss of the helix line will be less than 1.7 db at fre- 
quencies above 90 kmec. The smoothness of this transmission loss curve 
is the result of the continuous mode-filtering action of helix waveguide. 

The measured data obtained with a solid-wall-copper waveguide 443 
feet long are shown by the upper solid line curve of Fig. 3. The loss is 
plotted on a relative basis for reasons which are mentioned below. The 
fluctuations in the TE»; loss which occurred in this line are due to mode 
conversion-reconversion effects that resulted from imperfect waveguide 
geometry. These variations in loss were relatively small at the lower fre- 
quencies, but in the vicinity of 80 kme and above there were undesirably 
high loss peaks. 

For similar lines with the same mode filter spacing the average loss 
increases linearly with length. The results for the 448-foot line may also 


be extended by using the analysis of random imperfections’ to give the 


statistical properties of a much longer line. Thus, for a longer line with 
the same mode-filter spacing, the rms fluctuations in TK» loss will 
increase by the factor ~/n, where n represents the number of 448-foot 
sections which are required for the longer line. Both the average loss 
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TABLE | 
Frequency 
33-38 kme (41-50 kme |50-60 kme 60-70 kme |70-80 kmc 80-90 km«c 


Helix line average loss, db | 3.38 2.75 2.29 2.11 1.92 .79 
per mile 

Solid-wall copper average 3.53 2. 2.49 2.25 2.21 2.42 
loss, db per mile 

Solid-wall copper rms 0.050 ; 0.078 0.074 0.118 
variation in db for line 
1 mile long 


and the rms fluctuations of the solid wall are computed from the meas- 
ured data of Fig. 3 for a line one mile long in Table I. The average loss 
of the helix line is included. These data are for 10 kme frequency bands 
above 50 kme, and for the 33 to 38 and the 41 to 50 kme bands below 50 
kme. The data of Table I were prepared by C. W. Curry. 


V. CONCLUSIONS 


The observed TE; transmission losses over a frequency band of nearly 
3-to-1 (33 to 90 kmc) show low transmission losses. At the lower fre- 
quencies both lines show an increasing attenuation with decreasing fre- 
quency which approaches the theoretical value at 33 kme. The helix line 
shows a decreasing loss as the frequency increases, and is still decreasing 
at 90 kme, the highest frequency of measurement. Above 70 kme the 


solid-wall-copper line exhibits increasing loss with frequency. This is 


due to mode conversion-reconversion effects. Of the two lines, the helix 
waveguide has a lower attenuation and a smoother transmission charac- 
teristic. 
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The Effect of Rain and: Water Vapor on 
Sky Noise at Centimeter 
Wavelengths 


By D. C. HOGG and R. A. SEMPLAK 


(Manuscript received March 7, 1961) 


Measurements of sky noise temperature al a frequency of 6.0 kme have 
been made for various conditions of absolute humidity and precipitation. 
For an antenna beam position 5° above the horizon, the sky noise temperature 
was found to increase about 15°K from winter to summer due to the change 
in absolute humidity. During periods of rain, with the antenna beam pointed 
to the zenith, sky noise temperatures as high as 130°K have been observed, 
compared with the usual background value of 3° due to oxygen and water 
vapor. Theoretical calculations of sky noise temperature for typical dry and 
humid summer days are presented. 


[. INTRODUCTION 


This study is directed toward evaluating the variability in the noise 
level of sensitive ground-based microwave receivers to be used in radar 
or space communications.’ The sensitivity of such receivers is limited by 
the antenna noise received from the sky due to emission of radiation by 
the earth’s atmosphere and by noise sources in space. The sun and moon 
and the galaxy are serious noise contributors in space; however, since 
the sun and moon are seldom in the antenna beam and since galaxy noise 
is small at centimeter wavelengths, sky noise is due mainly to emission 
by the earth’s atmosphere. The atmosphere contains oxygen and water 
vapor, both of which absorb and therefore emit radio waves, especially 
in the microwave band. The oxygen content is relatively constant in time ; 
it therefore produces a sensibly constant background noise. However, 
the water vapor content and the noise it produces change with location 
and season. Condensed water, in the form of rain, is a strong absorber of 
microwaves and contributes a significant amount of noise, whose level 
changes with time. 

A brief theoretical treatment outlining the amount of noise to be 
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expected from the atmosphere is followed by a discussion of experimental 
data obtained at a frequency of 6 kme from a receiving site, at an eleva- 
tion of 370 feet above sea level, located at Crawford Hill, Holmdel, New 
Jersey. 


Il. THEORETICAL DISCUSSION 


The noise received from a horizontally stratified atmosphere by the 
infinitely sharp beam of an idealized antenna is given by P, kT .B, 
where k is Boltzmann’s constant, B is bandwidth, and 7, is effective sky 
temperature. If there is no precipitation, oxygen and water vapor are the 
two most significant contributors of noise, and the sky noise tempera- 
ture Is 


1, = (a, + a2)T exp| — | (a; + as) dr | dr, (1) 
“oOo “Oo 


with 7 being the actual temperature at a given point P in the atmosphere 
defined by the distance r along the antenna beam and an angle ¢, as 


shown in Fig. 1: a; is the power absorption coefficient due to oxygen a 
? 1 ae) 


point P and a is that due to water vapor, both being functions of fre- 
quency, temperature, and pressure. The oxygen absorption coefficient a 
is due to the magnetic dipole resonance of the oxygen molecule at fre- 
quencies near 60 kme. Pressure-broadening causes the skirts of these 
resonance lines to extend through the centimeter wave region, and the 
absorption contributes significantly to antenna noise. The water vapor 
absorption coefficient a, consists of two terms, sneenioen due to the 
electric dipole resonance of the water molecule near 22.5 kme and that 
due to the skirts of rotational lines in the infrared region. 

Assuming a perfectly dry atmosphere, a. = 0, and the effective sky 
noise temperature due to oxygen alone can be calculated using (1). These 
values are plotted as the lower curves of the groups of curves shown in 
lig. 2; they are based upon the temperature and pressure dependence of 
Vig. l(a). The four sets of curves in Fig. 2 correspond to various beam 
directions, ¢, of the antenna. The central curves in the figures are also 
calculated for standard temperature and pressure conditions, but in this 
case, a is not zero* since the atmosphere is assumed to contain water 
_ * In these calculations a line-broadening constant of 0.3 cm™! is used for the 
infrared lines; this choice is based on absorption measurements in the millimeter 
wavelength region. It has recently been proposed* that mutual coupling between 
O»2 and H2O molecules in air may cause additional absorption at millmeter wave 
lengths and that the broadening constant should be 0.1 em™'!, as it is for the 22.5 
kme line. This value would decrease considerably the noise temperature due to 
water vapor in the calculated curves of Fig. 2. Nevertheless, the measured data 


indicated in Fig. 6 appear to justify the use of 0.3 em”! as the line-broadening 
constant 
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Fig. 1 (a) Assumed model of standard summer atmosphere; (b) model of very 
humid summer atmosphere based on meteorological data from Newark Air Service. 


vapor of density 10 grams per cubie meter at the surface of the earth, 
which decreases linearly to zero at an altitude of 5 kilometers, as shown 
in Fig. 1(a). This condition is representative of average summer condi- 
tions in temperate zones. In computing both the lower and central 
curves, the standard value 4 was used for the normalized radio radius of 
the earth. 

The upper curves in Fig. 2 were calculated for an atmosphere of a 
typical humid summer day. This model atmosphere, shown in Fig. 1(b), 
is based upon meteorological data for August 8, 1960.* The normalized 
radio radius caleulated from these data is $ rather than 4, which means 


* Measured by the Newark Air Service at Newark Airport, Newark, New 
Jersey. 
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Fig. 2 — Typical sky noise temperatures for various antenna positions. In 


each case, lower curve is for very dry weather, center curve for average weather, 
and upper curve for very humid weather. 


that for large values of @ the antenna beam has a much longer path length 
through this atmosphere than through the one discussed above. This 
earth radius and the data of Fig. 1(b) were used in (1) to compute the 
upper curves of Fig. 2. Because of the rather extreme mcdels chosen for 
the atmospheres upon which the upper and lower curves are based, they, 
in the absence of super-refraction and condensation, represent upper and 
lower bounds on the sky noise one may expect from the atmosphere in 
the microwave band. 

Calculation of the effect of rain on sky noise temperature is difficult, in 


spite of the fact that the theoretical power absorption coefficient* is 
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known to agree well with measurements of absorption by rain in the 
microwave band. Difficulty arises because condensed water in the at- 
mosphere is not horizontally stratified. Moreover, the distribution of con- 
densed water as a function of altitude varies considerably with mete- 
orological conditions. For example, weather radar measurements have 
shown that rain may originate as high as 45,000 feet.* However, if the 
distribution of water along the path of the antenna beam were known, 
neglecting scattering by the raindrops, one would obtain its effect on 
noise temperature from (1) by substituting for a; + a, the appropriate 
a(r) for the water distribution. Thus, a measurement of sky noise is an 
integrated measure of the water content in the antenna beam. 


III, APPARATUS 


l‘or measurement of sky noise, it is desirable that the antenna and 
receiver proper have low intrinsic noise, relatively large bandwidth, and 
good stability. If these conditions are met, accurate absolute values of 
sky noise can be obtained and rapid changes in the level can be observed. 


A receiving system’ that meets these requirements consists of a traveling- 


wave maser’ and a horn-reflector antenna. 

This type of antenna was designed for microwave radio relay systems 
and is shown in Fig. 3; it consists of a section of a paraboloid fed by a 
pyramidal horn. The antenna is broadband and is well matched. How- 
ever, its virtues as a low-noise antenna stem from the fact that the far 
side and rear lobes are about 30 db below the isotropic level, thus allow- 
ing only a small amount’ of noise from the earth (which is a good ab- 
sorber at microwaves) to enter the antenna; also, only a short length of 
transmission line is required between the horn and receiver proper (0.1 
db transmission loss at room temperature produces 73° of noise). At 
6 kme, the beamwidth of the antenna shown in Fig. 3 is about 1.75 de- 
grees and its gain is 40 db. The antenna mount permits movement in the 
elevation plane only, but in this plane the antenna beam may be moved 
from horizon to horizon. To permit this movement, a rotary joint is 
used in the waveguide run between the antenna and the maser. The 
length of this waveguide run is minimized to reduce waveguide losses and 
their associated noise contributions. A means of ensuring the rapid exit 
of water from the antenna is of practical concern, since the aperture is 
open to the elements; a few holes in the seam formed by the parabolic 
section and horn satisfy this requirement. 

The essential features of the receiver proper are shown in Fig. 4 in 


* Private communication through the courtesy of J. 8S. Marshall, MeGuill 
University, Montreal, Canada. 
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Fig. : Horn-reflector antenna. 
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Fig. 4 Low-noise receiver for 6 kme 
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block diagram form. At 6 kme, the traveling-wave maser has a gain of 28 
db and a bandwidth of 15 me at a liquid helium temperature of 1.5°K. 
The magnetic field for the maser is provided by a permanent magnet. 
The pumping power is obtained from a stabilized 19.7-kme klystron with 
an output of approximately 40 mw. Isolators are used between maser 
and attenuator and between attenuator and balanced converter to 
prevent the possibility of noise power re-entering the maser and, due to 
any input mismatch, reappearing as a net increase in input temperature. 
The remainder of the system utilizes a balanced crystal converter and 
Il° amplification. 


IV. MEASUREMENT TECHNIQUES 


The over-all input noise temperature of the receiver consists of a con- 
stant term, which includes noise due to the antenna, input waveguide 
loss, maser, and the contribution from the second stage; these amount to 
15.5°K, which is the limit of the sensitivity of the receiver proper. An 
additional term is the sky noise temperature, which varies as a function 
of the microwave absorption in the volume of the sky subtended by the 
antenna beam; on a clear day the sky noise is about 3°, which increases 
the input noise to 18.5°K. 


During periods of relatively stable atmospheric conditions a compari- 


son method employing an argon noise lamp is used to measure the input 


temperature. The noise lamp is coupled into the input waveguide through 
a 30-db directional coupler. With the noise lamp switched off, the room 
temperature load which terminates the noise lamp introduces about 
0.3°IK; when the noise lamp is fired, the input noise is increased by 1O°K. 
The change in noise level is determined by using a precision microwave 
attenuator. This is set at zero with the noise lamp off, and the IF gain 
of the measuring equipment is adjusted to obtain a suitable output level 
of detected noise; then the noise lamp is fired, and the precision at- 
tenuator is adjusted to give the same noise output from the measuring 
set. 

The noise temperature at the input to the converter, 7;, , with the 
noise lamp off, is given by 


where 

G,, is the gain of the maser, 

T, is the effective antenna noise temperature (waveguide, antenna, 
and sky), and 
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T,, is the effective noise temperature of the maser. 
In this equation a small noise due to losses between maser and converter 


has been neglected. When the noise lamp is fired, the noise input to the 


converter becomes 
T (T. + Tn + TL)GnA + (1 — A)T), (3) 


where 
T, is the additional noise introduced into the input waveguide by the 
noise lamp, 
T) is ambient room temperature, and 
A is the reciprocal of the additional attenuation introduced by the 
precision attenuator. 
Let T; = T, + Tm; then, solving (2) and (3), 
AT, To 
1-—A ¥ 3 


The second term of this equation is the noise contribution of the pre- 
cision attenuator; with maser gains in the order of 30 db, it is less than 
0.5°K. 

The noise-lamp method of measurement, although quite accurate, is 
time-consuming and does not lend itself readily to continuous measure- 
ment, since atmospheric conditions can and do change rapidly. For a 
continuous record of the changes in sky noise, the detected output. of the 
intermediate frequency amplifier is applied to a two-channel Sanborn 
recorder. On one channel the full-scale sensitivity is of the order of 
250° K, while on the second channel it is +6°K. It should be noted that a 
15-me band is preserved up to the recorder and that the recorder has a 
maximum bandwidth of 80 cps. Simultaneously, the output is displayed 
on an oscilloscope. 

Ground-rain rate data obtained from a calibrated tipping-bucket rain 
gauge located about 50 feet from the antenna are fed to a timing channel 
on the recorder; each tip of the rain gauge bucket indicates that 0.01 
inch of rain has fallen. This type of gauge is fairly accurate, provided the 
wind is not excessive. The time interval between two consecutive bucket 
tips is carefully measured, and the rainfall rate is then plotted at the 
mean value of time of two consecutive tips. This procedure was used for 
all recorded bucket tips. 


V. DISCUSSION OF MEASURED DATA 
5.1. The Water Vapor Effect 


Fig. 5 shows a set of data taken on October 7, 1960, with sky tempera- 
ture plotted as ordinate and antenna beam position as abscissa. This 
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Fig. 5 — Sky noise temperature for a dry clear day. 


particular day was dry and clear; based on temperature and humidity 
measurements made at the receiving site, the absolute water vapor 
density was 5 grams per cubic meter at the ground. The curve labeled 
“effective input temperature” shows the values of the first-cireuit input 
noise temperature for various antenna positions; these values are the 
sum of the sky temperature plus the noise temperature contributed by 
input waveguide losses, maser noise, and antenna. The actual sky 
temperatures for the various antenna beam positions are plotted as dots. 
These data were obtained by averaging groups of measurements taken 
for each antenna beam position, as the antenna was moved from zenith 
to horizon in one-degree increments. For comparison, the dashed curve 
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represents the theoretical sky temperature based on a standard earth’s 
atmosphere, as obtained from the central curves of Fig. 2. 

rom the relative smoothness of the dotted curve in Fig. 5 and the 
absence of abrupt changes, one concludes that the back and side lobes of 
the antenna allow only a small amount of noise radiation from the earth 
into the system. There is little difference in sky noise between that meas- 
ured for a water vapor density of 5 grams per cubic meter and the theo- 
retical values based on a water vapor density of 10 grams per cubic 
meter. A period of two and one-half hours was required to complete this 
set of measurements. During this time atmospheric conditions could have 
changed somewhat, even though no change was visibly detectible. 

In order to determine the effects of water vapor density, antenna 
temperatures were measured for various sky conditions at random times 
during the year 1959-60. Fig. 6 shows the increase in sky temperature 
above zenith temperature for various water vapor densities. Again, the 
values for water vapor density were based on measurements made at 
the receiving site. Measurements with the antenna beamed 60°, 85°, and 
874° from the zenith are shown. As the antenna beam approaches the 
horizon, it looks through greater amounts of the earth’s atmosphere, so 
that the water vapor effect is accentuated. The data show more or less 
what we might expect: the noise temperatures increase with increasing 
water vapor density. The data shown here are conservative because they 
are taken relative to the zenith temperature for the day that the measure- 
ments were made; in other words, if this were not done, the slope of the 
curves would be slightly greater. With the antenna beam at 60° from 
the zenith, the sky noise temperature increases about 1° from winter to 
summer; at 85° from zenith, it increases 15° from winter to summer. 
These increases are partly due to changes in the effective radius of the 
earth, since the refractive index profile also changes with water vapor 
content. The scatter in the data is partly due to the following: water 
vapor measurements are only made at the receiving site; the distribution 
of water vapor with height is unknown, and hence the changes in the 
effective earth’s radius are also unknown. 


The dashed curves in Fig. 6 connect three theoretical noise tempera- 


tures for water vapor contents 0, 10, and 20 grams per cubic meter, as 
obtained from Fig. 2. Especially in the case of the 85° curve, agreement 
with the experimental data is considered to be good. 


5.2 The Rain Effect 


Sky temperature measurements taken on a drizzly day (May 18, 1960) 
for various zenith angles of the antenna beam are shown in Fig. 7; the 
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Fig. 6 — The effect of water vapor on sky noise temperature 


ground rain rate of about 6 millimeters per hour is also indicated on the 
figure.* The increase in sky temperature due to the light rain is readily 
apparent. The curve titled “18 May 1960 2:45 p.m.” in this group is 
interesting, for here, at beam position of 85° and greater, the noise 
temperature is essentially the same as it is for the dry day. Hence it 
appears as though the rain was fairly local at the time the measure- 
ments were made. All data discussed so far were obtained using the RI 
attenuator method. 

* It should be emphasized that the indicated rainfall rates are measured at the 
antenna site. When the beam approaches the horizon, however, it looks through 


several hundred miles of atmosphere, which may contain rain of various densities 
In this case, the measured rain rate at the receiving site is of little significance 
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Fig. 7 Sky noise temperatures for dry and wet days. 


Sky temperature data for rapidly changing sky conditions which occur 
during periods of rainfall, obtained by means of the recorder, are shown 
in Figs. 8 through 12. These were selected as representative of the changes 
which usually occur in zenith sky temperature just prior to and on 
arrival of the rain at the ground. The left ordinate is zenith sky tem- 
perature, the right ordinate ground rain rate, with time on the abscissa. 
Before the arrival of the rain and after its departure from the location, 
the sky temperature may undergo variations on the order of several 


degrees. One should note on the measured data that an abrupt increase 


in sky noise is observed well before any significant rain is measured at 
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Fig. 8 Sky noise temperatures observed during a rain storm (June 24, 1960) ; 
note that the sky temperature increases before the rain rate 


the ground; hence the sky noise is a precursor of rain at the ground. 
Most probably, the precursory action is due to a slope in the rain column 
as the water falls from a fairly localized cloud which is moving with the 
upper wind.* Thus the antenna beam sees a maximum water density 
before it is observed at the ground. 

Of special interest are the data shown in the last portion of Fig. 9 
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Fig. 9 Relatively high sky noise with little or no measurable rainfall (June 
24, 1960). 


* See Fig. 9 of Ref. 7. 
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Note that, although the sky temperature increased more than 40°K, no 
rain was recorded. It is apparent here that the antenna beam looked 
through a cloud containing a large amount of condensed water. Even 
though no rainfall was recorded at the receiving site, the water from this 
cloud was no doubt released further along the path of the storm. The 
data in Fig. 9 are in direct contrast with those in the last portion of Fig. 
8, for in the latter case a rain rate in excess of 150 millimeters per hour 
produced only moderate sky temperatures. A good example of the rapid 
and large changes that can occur in sky temperature is shown in Fig. 10: 
in a period of about 40 seconds it increased 66° K. The abrupt break in the 
sky temperature curve was due to a 20-second power interruption caused 
by lightning; the remainder of the interval was used in re-checking the 
calibration of the measuring equipment. 
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Data obtained during a severe thunderstorm (June 





EFFECT OF RAIN AND WATER VAPOR ON SKY NOISE 1345 


There appears to be no detailed correlation between measured ground 
rain rate and zenith sky temperature; for example, in Fig. 11, fairly high 
sky temperatures were recorded when ground rain rates were low. This 
lack of correlation is further indicated in Fig. 12(a), for during the 
period shown the noise is fairly high but no measurable amount of rain 
fell. Fig. 12(b), however, shows that small variations in zenith tempera- 
ture can occur with correspondingly low rain rates. Since data of this 


type are a measure of the water content of the atmosphere, they are of 


some meteorological significance and perhaps may serve as adjuncts to 
measurements by weather radar. 

Since rain noise in space receivers plays the counterpart of signal fad- 
ing in line-of-sight communications, a percentage-time distribution of 
sky noise is required for making reliability estimates. Although a rela- 
tively small statistical sample has been obtained so far, Fig. 13 shows 
both distributions of zenith sky noise temperature and ground rain 
rates. Additional measurements at the zenith and especially at beam 
positions near the horizon, where the background noise is higher, are 
needed to improve the statistics and to indicate the limit for a communi- 
cation system. The median value of zenith sky temperature is 14.7°K or 
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Fig. 11 Iixample of lack of correlation between measured ground rain rate 
and zenith sky noise temperature (data of June 18, 1960). Compare these data 
with those in Fig. 8 
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Fig. 12 a) Example of lack of correlation bet ween measured ground rain rate 


and sky temperature (June 3, 1960); (b) small variations in sky noise temperature 
July 27, 1960 


7.2 db above 2.8°K, the zenith temperature for an atmosphere containing 
a water vapor density of 10 grams per cubic meter, the corresponding 
median rain rate being 5 millimeters per hour. The noise temperature 
follows a log-normal distribution with a standard deviation of approxi- 
mately 4 db. It should be cautioned that the relationships between 
the noise and rain rate distributions do not necessarily apply for any 
other general location, even though the ground rain rate distribution at 
that location is known. 

It was mentioned in Section II that, if the distribution of rain along 
the antenna beam were known, one could calculate its noise contribu- 
tion, since the absorption coefficient, a, is a known function of wave- 
length and rain density. If the assumption is made that the raindrops 
are uniformly distributed over a given interval along the beam, then the 
absorption a(r) is a constant over the interval. Calculations based upon 
(1), measured 6-kme sky noise and measured ground rain rate, then can 
be made for the noise temperature which should obtain at other fre- 
quencies in the microwave band. For example, using this procedure,* a 

* It should be noted that scattering of noise radiated from the earth into the 
antenna by the raindrops contributes to the total sky noise. However, calculations 
show that the scattering contribution is small. For example, again using a rainfall 
rate of 47.0 millimeters per hour and a measured 6 kme temperature of 110°K, the 


contribution by scattering in the far field of the antenna is 3.3°K, the near field 
contribution 0.6° K, and the total scattered contribution 3.9°K, which is only 


) 


3.5 per cent of the total measured value 
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Fig. 13 — Percentage time distributions of zenith sky noise and ground rain 
rates, for eight rain periods from March 3, 1960 to July 27, 1960. 


rainfall rate of 47.0 millimeters per hour that produced a measured tem- 
perature of 110°K at 6.0 kme would produce a corresponding tempera- 
ture of 34°K at 4.0 kme, a decrease of 5.0 db. 

Snow is another form of condensed water prevalent in the atmosphere. 
Noise measurements were made during a snowfall on January 27, 1961, 
but these show only a slight increase over those obtained on a clear day. 
Since snow is a fairly low-loss dielectric at centimeter wavelengths, this 
result is not unexpected. 
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VI. CONCLUSIONS 


Based upon measurements of sky noise temperatures made in atmos- 
pheres devoid of rain at a frequency of 6 kme, the following seasonal 


changes in sky temperature can be expected: at an antenna beam posi- 


tion 60° from the zenith, approximately 13°K; at 85°, about 15°K. If 
the assumption is made that the desirable low-noise frequency band 
extends from 2 to 8 kme, again neglecting the effects of rain, theoretical 
calculations establish the following limits: with an antenna beamed 60° 
from the zenith and for a frequency of 2 kme, the seasonal change in 
sky temperature is 0.6°K; for a frequency of 8 kme, approximately 5°K. 
With an antenna beam 85° from the zenith at a frequency of 2 kme, 
this change is 1.0°KK and, at 8 kme, it is 26°K. Data obtained at 6 kme, 
with the antenna beam in the favorable zenith position, indicate that 
rain can increase the sky noise temperature as much as 17 db. In a 
space communication system, these changes in noise level are expected 
to play the: counterpart of signal fading in line-of-sight communications; 
therefore suitable margins must be allowed for them. 
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The Calculation of Metallic Hall 


Constants: Topological Aspects 


and Appl ‘ations to Copper 


By JOHN R. KLAUDER 


(Manuscript received March 30, 1961) 


The theoretical prediction of the Hall constant in metals appropriate to 
high magnetic fields depends, in part, on a repeated computation of a 
specific topological invariant: Each of several parallel two-dimensional 
planes in crystal-momentum space must be analyzed to find, on a single 
sheet of the Fermi surface, the number of nonintersecting closed contours 
that surround an arbitrary point. This number is related to the effective 
sign of the charge of the “relevant” electronic carriers in the plane car- 
riers whose contribution to the Hall constant ts based on their orbital area. 
Applications of the technique discussed in this paper have been made to 
the Pippard model for copper, and various carrier contours are illustrated 
that were found in the course of three calculations of the Hall constant in 
cases where the magnetic field direction lay along a principal crystallo- 
graphic axis. 


I. INTRODUCTION 


When a current-carrying metallic sample is placed in a uniform mag- 
netic field, a change in the resistance is generally observed. This phe- 
nomenon, known as magnetoresistance, is the basis for an important 
experimental technique for studying the band structure of metals. If a 
simple model is taken to represent a metal (i.e., an effective mass ap- 
proximation, a constant relaxation time, ete.), then the expected change 
in resistance can be theoretically predicted. Even with a more realistic 
model, it is found that one experimentally observable feature can still 
be predicted, namely, the Hall constant. The analytical expression for 
the Hall constant [presented in (4)| is an integral over a family of 
parallel two-dimensional planes. For each plane the integrand is pro- 
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Fig. 1 Several Fermi energy contours, shown in heavy lines, generated by 
the intersection of the plane P, = constant with a hypothetical constant energy 
surface 


portional to the area of that part of the plane surrounded by an odd 
number of zero-contours of a function defined in that plane. 

Prediction of this resistance parameter turns out in general to be a 
somewhat complicated analytical test, and this paper discusses some 
interesting aspects of a numerical method developed by the author to 


calculate the Hall constant. In part, the calculation procedure repeatedly 


requires a topological investigation to be carried out on a two-dimensional 
plane in order to evaluate a particular topological invariant associated 
with that plane. 

Stripped of the particular physical problem which motivates the 
study, the essence of the topological inquiry is the following: Suppose 
there is a container with many pieces of string of varying lengths. The 
ends of each piece of string are tied together so that each piece forms a 
closed loop. As many loops of string as desired are placed on a table, 
subject only to the requirement that they do not touch one another; 
it should be emphasized that loops within loops are permitted (see Fig. 
| for a suitable example). Now, select a point on the table not on a 
string itself, and ask for the number of loops which surround that point 
(thus, for example, point | in Fig. 1 is surrounded by one loop, loop a). 
This is clearly a topological question, since the number of loops which 
surround a point is unaffected by an arbitrary displacement of each 
loop — a displacement which in addition does not permit any loop to 
pass through the point of investigation. 





METALLIC HALL CONSTANTS 


A simple pictorial model should help to clarify the role of topology in 
the calculation of the high-field Hall constant. Fig. 2 shows a hypotheti- 
cal energy surface with cubic symmetry, composed of “spheres”? and 
“eylinders.”” The intersection of this surface with each of a family of 
parallel planes defines the set of contours that are pertinent for the 
Hall constant calculation. For definiteness, consider those planes, two 
of which are illustrated in Fig. 2, perpendicular to the [OO1| axis. If 
the plane, like plane a in Fig. 2, cuts across the cylinders, then it is the 
area of the various cylindrical cross sections that contributes to the 
Hall constant calculation. However, for plane b, the second plane shown 
in Fig. 2, the contribution to the Hall constant is based on the planar 
area outside the spheres and cylinders, since now the regions in that 


plane surrounded by an odd number of contours lie outside the cylinders. 


Other planes are analyzed in a similar fashion. 




















Fig. 2 A portion of an ideal Fermi surface model of cubic symmetry com 
posed of spheres of radius A and cylinders of radius D. The cube edge is of length 
B, and represents also the length of the cubic Brillouin zone, which is centered 
about each sphere. Two (O01) planes, a and 6, illustrate several different planar 
orbits 
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Fig. 1 Several Fermi energy contours, shown in heavy lines, generated by 
the intersection of the plane P, = constant with a hypothetical constant energy 
surface. 


portional to the area of that part of the plane surrounded by an odd 
number of zero-contours of a function defined in that plane. 

Prediction of this resistance parameter turns out in general to be a 
somewhat complicated analytical test, and this paper discusses some 
interesting aspects of a numerical method developed by the author to 
calculate the Hall constant. In part, the calculation procedure repeatedly 
requires a topological investigation to be carried out on a two-dimensional 
plane in order to evaluate a particular topological invariant associated 
with that plane. 

Stripped of the particular physical problem which motivates the 
study, the essence of the topological inquiry is the following: Suppose 
there is a container with many pieces of string of varying lengths. The 
ends of each piece of string are tied together so that each piece forms a 
closed loop. As many loops of string as desired are placed on a table, 
subject only to the requirement that they do not touch one another; 
it should be emphasized that loops within loops are permitted (see Fig. 
| for a suitable example). Now, select a point on the table not on a 
string itself, and ask for the number of loops which surround that point 


(thus, for example, point 1 in Fig. 1 is surrounded by one loop, loop a). 


This is clearly a topological question, since the number of loops which 
surround a point is unaffected by an arbitrary displacement of each 
loop — a displacement which in addition does not permit any loop to 
pass through the point of investigation. 
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A simple pictorial model should help to clarify the role of topology in 
the calculation of the high-field Hall constant. Fig. 2 shows a hypotheti- 
cal energy surface with cubic symmetry, composed of ‘spheres’? and 
“eylinders.”’ The intersection of this surface with each of a family of 
parallel planes defines the set of contours that are pertinent for the 
Hall constant calculation. For definiteness, consider those planes, two 
of which are illustrated in lig. 2, perpendicular to the [001] axis. If 
the plane, like plane a in Fig. 2, cuts across the cylinders, then it is the 
area of the various cylindrical cross sections that contributes to the 
Hall constant calculation. However, for plane b, the second plane shown 
in Fig. 2, the contribution to the Hall constant is based on the planar 
area outside the spheres and cylinders, since now the regions in that 
plane surrounded by an odd number of contours lie outside the cylinders. 
Other planes are analyzed in a similar fashion. 
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Fig. 2-—- A portion of an ideal Fermi surface model of cubic symmetry com 
posed of spheres of radius A and cylinders of radius D. The cube edge is of length 
B, and represents also the length of the cubic Brillouin zone, which is centered 
about each sphere. Two (001) planes, a and 6, illustrate several different planar 
orbits. 
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In Section II, we define the customary resistance parameters in terms 
of the general resistance tensor that determines the electric field in 
terms of an applied current. The analytical expression for the high-field 
Hall constant is related to the direction of the magnetic field, the Fermi 
surface of the metal, and the symmetry of the erystal. In Section III 
those aspects of our calculation procedure that study the topology of 
nonintersecting planar contours are discussed in detail, and a logical 
procedure is developed, suitable for study with the aid of a digital 
computer, by which the “relevant” area may be ascertained in the most 
general case. In addition, a brief discussion is included of the significance 
of open contours, i.e., those contours that possess a net direction and 
never close upon themselves. 

Section IV presents some of the numerical results obtained in the 
calculation of the Hall constant for copper. These results are based on 
the phenomenological model for copper that was proposed by Pip- 
pard.’ Several figures are shown that depict orbits, both closed and 
open, computed for various charge carriers in planes transverse to the 
three axes [001], [O11], and [111]. 


II. BACKGROUND AND ORIGIN OF THE PHYSICAL PROBLEM 


In the most general situation, the relation between an applied cur- 
rent density (j,) and the induced electric field (#;) is deseribed by a 
resistivity tensor (p,;) according to the equation 


E, Preiji ° (1) 


Here, and in what follows, all indices run from 1 to 3, and the summa- 
tion convention is adopted. In the presence of a magnetic field (//,), 
the tensor py, becomes a function of H,, restricted by the Onsager 
relations so that px:( —H,, ) pu(H,,). Two invariant expressions are 
commonly taken as descriptors of the resistivity tensor. The first is the 
resistivity p, defined by 


p PirINKN7 , (2) 


a double summation involving the unit vector n; which is in the direc- 
tion of the current 7;. The second invariant parameter is the Hall con- 
stant Ry, , defined by 


| PKl€kin H.. } 
H? 


Rw (3) 
where, as usual, H HH, and ei» is the three-index, completely 
antisymmetric tensor defined as +1 (or —1) whenever (/lm) is an even 


) 


(or odd) permutation of (123). Observe that the resistivity, (2), is 
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sensitive only to the symmetric part of the resistance tensor, while the 
Hall constant, (3), depends on the antisymmetric part of p,;: . 

The theory developed by Lifshitz, Azbel, and Kaganov’ and by Lif- 
shitz and Peschanskii’ treats conduction in a magnetic field in a very 


general way. The underlying picture of their theory is most easily 
visualized in crystal-momentum space, the dual space of the ordinary 
three-dimensional space in which the metallic lattice lies. At very low 
temperatures the motion of the charge carriers in a metal is confined 
to a unique energy surface, the Fermi surface; the carrier motion is 
also confined to lie in one of the planes in momentum space perpendicular 
to the direction of the magnetic field H;. The periodicity of the lattice 
in physical space leads to a periodicity of the reciprocal lattice in mo- 
mentum space, and the periodicity of the reciprocal lattice is imposed 
on the energy surfaces imbedded in momentum space. 

If a closed segment of the Fermi surface is contained within a Bril- 
louin zone, the fundamental zone of periodicity which we identify with 
the usual or “proximity”? zone, then the orbits on which the charge 
carriers move are all closed contours, when studied for any direction of 
the magnetic field. If the region immediately within an orbit contains 
energies less than the Fermi energy /; (a value which defines the Fermi 
surface), then that orbit is said to be “‘electron-like.’’ Conversely, if the 
immediately interior region has energies greater than /; , then the orbit 
is ‘‘hole-like.”’ 

When the Fermi surface encounters the boundary of the Brillouin 
zone, then the periodicity requirement can lead to a connected surface 
extending throughout the entire momentum space. For instance, the 
simple cubic model in Fig. 2 represents a connected Fermi surface com- 
posed of “spheres,” each in a separate Brillouin zone, that are connected 
by “eylinders.”” On such an open surface, some orbits — open orbits 
can have an infinite extension, never becoming closed. In particular, 
planar open orbits directed along the cylinders exist for the model il- 
lustrated in Fig. 2. Nevertheless, even for open Fermi surfaces, certain 
directions of the magnetic field (whose direction dictates the family of 
parallel planes on which the orbits lie) lead to all closed orbits; i.e., 
ach orbit in every plane is closed, at least in some finite number of 
Brillouin zones. The fundamental property of all closed orbits charac- 
terizes the class of magnetic field directions for which the Hall constant 
Ry, can be theoretically predicted. Under these circumstances, the 
analytical expression,” 


h i 
“ 2ec (| ? f p. tp») 


1 
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defines Ry, , where h is Planck’s constant, e > 0 is the value of the elec- 
tronic charge, and ¢ is the speed of light. The integration over p, repre- 
sents a summation over the various momentum planes perpendicular 
to the magnetic field; now, and hereafter, the z direction is taken along 
the direction of the magnetic field. The domain of integration in (4) 
can be restricted to a single Brillouin zone, as will be discussed in Section 
3.1.1. The contour integral extends around closed orbits in each p,p, 
plane. If an orbit is electron-like, the contour integral in (4) is a positive 
number proportional to the area of the interior domain. On the other 
hand, if an orbit is hole-like, the contour integral in (4) is a negative 
number proportional to the area of the interior domain. 

After suitable scale factors are introduced it suffices to study a 
completely geometrical factor in the Hall constant, which we call G: 
Let p; — P;, where the P; represent appropriate dimensionless vari- 
ables; then 


G = | aP. p PedP,. (5) 
This integral is based only on (a) the direction of the magnetic field, 
which selects the z direction, (b) the several plane surfaces, fixed in 
crystal-momentum space, which define the Brillouin zone boundary, 
and (c) the Fermi surface; also fixed in crystal-momentum space, on 
which the orbits are constrained to lie. In many substances the Fermi 
surface is multisheeted in each Brillouin zone. In this case a “G’’ may 
be determined for each sheet independently according to (5); the true 
“@” is the sum of the various individual ones. For simplicity, we confine 
our further analysis to a Fermi surface with only one sheet. 


Ill, STUDY OF THE GEOMETRICAL FACTOR IN THE HALL CONSTANT 
3.1 Computational Procedure 


3.1.1 Domain of Integration and Subdivision of the Calculation 


The formal definition previously given” for (4) [or (5)] states that 


the integration domain covers all of momentum space (infinite in ex- 
tent), but that the redundancy of the total number of Brillouin zones 
(also infinite) is to be divided out. The periodicity of the energy surface 
suggests that the numerical result obtained from an integration over 
this weighted infinite domain should be equal to the result of a mo- 
mentum integration restricted to only one Brillouin zone. Indeed, if all 
the closed orbits lay within a single zone (as on a small sphere), then 
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the integration domain could readily be restricted to only one Brillouin 
zone. 

However, when the Fermi surface is open, it is a common occurrence 
that all of the different closed orbits do not lie within a single zone. For 
example, for certain directions of the magnetic field, some of the larger 
orbits may close only after excursions through a number of different 
zones. It is the possibility that there may be some large orbits, which 


necessitates the seemingly redundant integration over a correspondingly 


larger domain of momentum space, so that within each P.-plane, the 
true nature of the larger orbits, i.e., whether they are electron- or 
hole-like, may be ascertained directly. 

If an independent determination of the nature of all the orbits 


se 


say, a “list”? of their electron- or hole-like character — were already in 
hand, then G could be calculated by an integral over only one Brillouin 
zone. This limited integration would be valid because the “list’’ would 
already specify the true nature of any orbit for which only a portion of 
that orbit lay on a single plane within one Brillouin zone. It is this gen- 
eral scheme that we employ to limit the domain of integration to the 
customary first Brillouin zone centered about the point P, = P, 

P, = 0. 

Must we examine every orbit in an arbitrary P,-plane for our ‘“‘list’’? 
No, fortunately, as the following argument shows. In a fixed momentum 
plane, simultaneous electron- and hole-like orbits can exist on a single 
sheet of the Fermi surface only if one orbit type encloses another. 
Several different orbits are pictorially indicated in Fig. 1, separated by 
clear and shaded regions which represent, for example, energies above 
and below the Fermi energy. For the particular P,-plane shown in Fig. 
1, the geometrical factor G would acquire contributions from the entire 
area within the orbit marked a and also, with the opposite sign, the 
entire area interior to the orbit marked b. The net result consists in 
including, with a single sign, only the shaded area between the orbits 
a and b. This same line of reasoning applies to the remainder of the 
orbits in this plane, so that just the shaded area within the boundaries 
of the first Brillouin zone contributes to G. As a result only the ‘shaded 


‘ 


area,” or, as we shall call it hereafter, the “relevant area,” corresponding 
to a unique sign of the energy relative to the Fermi energy, enters into 
the calculation of the Hall constant for each plane P, = constant. Thus, 
to obtain the equivalent of a “list”? of the orbits suitable for calculating 
G, it suffices to study just one point. If this point is surrounded by an 
odd (even) number of contours, then it is within (outside) the relevant 
area; all points whose energy F relative to the Fermi energy EF; is of 
the same (opposite) sign lie in the area relevant in calculating G. 
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3.1.2 Certain Orbit Characteristics Determined by a Topological Study 


The relevant area in Fig. | is independent of whether the interior 
points have energies greater or less than the Fermi energy. Indeed, Fig. 
| graphically illustrates the following property: Every point in the 
re levant area is enclosed by an odd numbe r of closed orbits. lor example, 
point | is within a shaded region of Fig. 1 and is enclosed by one loop, 
and so on for points 2 and 3. Furthermore, this characterization of the 
points in the relevant area is invariant under any deformation of the 


contours such that no contour passes through the point under study. 


This invariance property is topological in nature, being independent of 
<pecific details of the orbit shapes. 

In order to accurately determine the number of contours surrounding 
a given point, a portion of the plane P constant must be “scanned”’ 

a portion generally much larger than that lying in the first Brillouin 
zone. Orbits which surround a point can only be found if they are ac- 
tually encountered in the investigation; however, the restriction to a 
finite region of investigation is not serious in practice. 

In the selection of a | gical procedure to find the number of contours 
enclosing a particular point, simplicity is certainly a desirable feature. 
One very simple means of counting the contours (modulo 2, which is all 
that is necessary here) would seem to be the following: Pick a point 
not on a contour (i.e., & + E,) and, starting there, investigate the 
energy values for a large sequence of closely spaced points all lying in a 
straight line. If an odd (even) number of crossings of the Fermi energy 
is found, the original point lies inside (outside) the relevant area. This 
intuitive procedure is, unfortunately, not always correct. Fig. 3 illus- 
trates a set of contours in a fixed plane and shows three possible paths 
that could be used to determine the topology using the preceding scheme. 
Joth paths 1 and 2 encounter three contours, and both emanate from 
shaded regions, i.e., relevant regions, as would be predicted by the 
above rule. However, path 3 illustrates the difficulty with this tech- 
nique. In this example, the path ends inside a “‘sand-trap,’’ and only 
two contours are encountered. Thus, we would erroneously conclude 
that the initial point did not lie within the relevant area. Clearly, a 
more sophisticated procedure of investigation is required that avoids 
the possibility of ending in a “‘sand-trap”’ by skirting around it. 

Several alternate schemes of investigation come to mind which have 
the desired property of skirting around ‘‘sand-traps.”’ In each of these 
an original test point is selected, and a series of points lying in some 
preselected straight line are tested for the crossing of a Fermi surface 


contour. If a contour is encountered, the contour is then followed, by 
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Fig. 3 — Fermi energy contours in a plane of fixed P. . Three different paths 
directed as shown, are tested in an attempt to determine where the shaded o1 
“relevant’’ area lies. 


always turning (with no loss of generality) initially to the left. The 


schemes differ in how far the contour is followed before the path leaves 
it. 
One suggestive scheme has the path leave the contour as soon as it 


can proceed, without crossing the contour itself, in a radial direction 
whose origin is the original test point. A simple study reveals several 
fallacies in this scheme. Another scheme, of slightly greater complexity, 
has the path follow the contour until an extension of the original straight 
line of investigation is first encountered. At this point the path leaves 
the contour and continues in the original direction searching for a new 
contour. This scheme is likewise not foolproof in general, and must be 
abandoned. Among the contours it fails to analyze correctly is one 
shaped like the letter G if the test point is below the ‘hook’? and the 
direction to be followed is upwards. Of course, in the general case, the 
location of the original point and the direction of inquiry are quite 
arbitrary, relative to the position and orientation of the contours to be 
studied. 

The method of investigation illustrated in Fig. 4, although still more 
complicated, leads to an accurate count of the orbits surrounding any 
point. When a contour is encountered in the present scheme, that con- 
tour is followed completely until the point is reached at which the contour 
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Fig. 4 A satisfactory means of investigating an arbitrary layout of contours 
to establish the number of orbits which surround any point. It is applied here to 
a plane of fixed P, that contains two distinct contours 


was originally engaged. While the entire contour is being examined, 
the number of points on the contour are counted that lie along a line 
which is an extension of the original line of investigation. Whether this 
number is odd or even determines whether that contour does or does 
not surround the original test point.* After the contour is completed, 
the path jumps to the furthest of these points, and then proceeds radially 
again along the original direction searching for a new contour to study. 
If a new contour is found, it is studied in completely the same fashion. 
Fig. 4 schematically illustrates the steps in a complete study of one 
contour by the present technique: the quest for the contour, the entire 
tracing of the contour, and the subsequent leaving in the original di- 
rection to continue the topological investigation. This investigation 
procedure is the one employed in the topological study of the various 
P.-planes in calculating the geometrical part of the Hall constant by (5). 
A straightforward numerical integration, not discussed here, was used 
to determine the appropriate area in each plane that contributes to G. 

* The more lengthy procedure ot summing the polar angle variations along the 
contour can and has been used. This method can provide an exact count of the 


number of contours surrounding the original point, rather than an exact count 
modulo 2 provided by the method discussed in the text 
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3.2 Additional Features of a Calculation Encountered in Practice 


It was pointed out in Section Il that the Hall constant cannot be 
theoretically predicted on the basis of (5) for magnetie fields which lead 
to a sizable, or nonnegligible, number of open orbits. However, this 
does not mean that open orbits will not be encountered in’ practical 
calculations of the Hall constant. On the contrary, open orbits may be 
present for one of several reasons. For example, all the magnetic field 
directions which support open orbits may not be known a priori for a 
phenomenological Fermi surface; only by trial and error can all such 
directions be discovered. 

Another reason why open orbits may be encountered is based on the 
fact that numerical computations on digital machines can never be 
made with absolute precision. In particular, in the technique employed, 
a path of investigation is said to follow a contour so long as the value 
of the energy F satisfies the relation | — E,;| < ¢, where ¢ is a small, 


47 . . . > 
nonzero number (e + 10 “F;), fixed by the gradient of the energy fune- 


tion and by practical computation requirements. It is entirely possible 
that two closed orbits which come close to each other cannot (computa- 
tionally) be distinguished from a single connected orbit if the energy 
values satisfy | kh — E,| < ¢€ along some path that connects the con- 
tours. If a sequence of closely spaced closed orbits exists, an error may 
be repeatedly or even unpredictably made, so that the path of investiga- 
tion does not close. While the likelihood of errors being made decreases 
as € is made smaller, a proviso must be made for encountering either one 
of two types of “open”? contours: (a) valid open orbits for which the 
path proceeds through momentum space along an open contour possess- 
ing a net directivity, or (b) “lost”? open orbits, which may or may not 
indicate the presence of true open contours, for which the path of in- 
vestigation gets lost and wanders rather aimlessly throughout momen- 
tum space. The first type of open orbit, i.e., a valid open orbit, is de- 
tected to a high accuracy by placing an upper bound on the radial 
distance from the test point that the path travels in momentum space; 
the second type, i.e., a “lost”? open orbit, is sensed by an upper bound 
on the number of individual points of investigation that can be made 
along any single contour. Whether an open orbit declared ‘‘valid’’ is 
truly open or just part of an exceedingly extended closed orbit is not 
too important, since the magnetic field is, after all, finite, and greatly 
extended closed orbits become indistinguishable from open orbits when- 
ever the orbit circumference ¢ divided by the mean free scattering length 
{is comparable with the magnetic field strength, suitably normalized. 
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Such considerations can in fact act as a guide in selecting the upper 
bounds placed on the path’s excursion. 

The presence of open contours generated by “tunnelling through an 
energy barrier” (i.e., along a path such that | k — b;|) < e) is not 
necessarily an academic case when ¢ is small. There is a small but very 
important class of magnetic field directions that lead to both closed and 
open orbits, but to the latter only on a set of P.-planes of measure zero. 
The hypothetical connected Fermi surface illustrated in Fig. 2 is one 
for which sets of open orbits of measure zero exist when the magnetic 
field is along the principal crystalline axis [001], or equivalently along 
either the [010] or [100] directions. 

ven in the presence of an infinitesimal number of open orbits, the 
geometrical part of the Hall constant is still theoretically predicted ac- 
cording to (5), since the contribution from those few P,-planes is negli- 
gible. In fact, those magnetic field directions that lead to a negligible 
number of open orbits are often among the most interesting, for they are 
axes that possess high symmetry. When a Hall calculation is carried out 
for such a direction and one of the few open orbits is encountered (in 
one of the two ways discussed above ), that entire P?,-plane is disregarded ; 
no investigation of the size of the relevant area is or even can be made. 
Since the open orbits are ideally of zero measure, they should be en- 
countered in the computation on at most a few of the large number of 
parallel planes studied that are perpendicular to the magnetic field. An 
open orbit of this kind, ideally belonging to a set of measure zero, will 
be illustrated in the next section. 

Finally, there is some additional information that can be found from 
a knowledge of the extent and of the directions of the open orbits them- 
selves. This additional information pertains to the expected behavior of 
the other magnetoresistance parameter, the resistivity p defined in (2). 


Generally speaking, it follows from theory” that, on the one hand, the 


resistivity saturates in a high magnetic field for those magnetic field 
directions which lead to all closed orbits, and, on the other hand, the 
resistivity continues to increase as H” for those magnetic field directions 
which possess a nontrivial set of open orbits, all of which lead off in the 
same net direction. Thus, to predict qualitatively the resistivity behavior 
for various magnetic field directions, it suffices to study the geometrical 
shape of the Fermi surface in order to see for which directions of the 
magnetic field open orbits are permitted. 

Conversely, the analysis of high-field resistivity data gained from 
measurements on high-purity metal single crystals is one of the best 


means available to gain information regarding the topology of the Fermi 
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surface, and even to provide some quantitative limitations on the shape 
of the Fermi surface. Therefore, a valuable auxiliary feature of a general 
procedure to calculate the Hall constant is the ability to shed some light 
on the expected resistivity behavior of a particular Fermi surface model 
by displaying the open orbits that are encountered for an arbitrary 
magnetic field direction. Preliminary discussions of the interpretation 
of the observed resistivity behavior for copper have been given else- 


6,7 


4.5, . . . . . ° 
where, and an extensive paper is in preparation in collaboration 


with J. E. Kunzler that will discuss magnetoresistance measurements 
and their explanation in some detail. 
IV. APPLICATION TO METALS: COPPER 


In this section we present some of the results of Hall constant analyses 


for a particular model of the Fermi surface of copper. Copper, or any 


of the noble metals, is particularly attractive for a discussion of the 
present type, since it involves a Fermi surface with only one “sheet”; 
the essential ingredients needed to discuss more complex metals are 
present without any unnecessary complications. 

Copper has a face-centered cubic space lattice whose cube edge is 
3.61 angstroms long. It follows that the reciprocal lattice in erystal- 
momentum space is body-centered cubic. The Brillouin zone that is 
characteristic of this lattice is an octahedron, truncated by a cube at the 
six vertices. If we make use of the cubic symmetry, then two inequiva- 
lent. planes suffice to define the Brillouin zone boundary: one face per- 
pendicular to the [111] axis — this face forms one of the eight surfaces 
of the octahedron; and three equivalent faces perpendicular respectively 
to the [100], [010], and [001] axes — these faces truncate the octahedron. 

Because copper has only one valence electron, the region within the 
first zone that is also contained within the Fermi surface must equal one- 
half the total volume of the first Brillouin zone. Subject to this volume 
requirement, the confining surface may, in principle, be arbitrarily de- 
formed. Suppose, as an extreme, the lermi surface assumes the free- 
electron spherical shape. It is readily deduced from the volume con- 
straint that this sphere does not intersect the boundary of the Brillouin 
zone, although it does come close to the octahedral faces. In this case 
the Fermi surface would consist of a number of disconnected spheres, 
each lying completely within a separate zone of periodicity. Only elee- 
tron-like orbits would exist: no hole-like or open orbits would be pres- 
ent. Consequently, the Hall constant would depend on the sum of all 
the electron-like areas, i.e., it would depend simply on the volume within 
the Fermi surface. The observed Hall constant would be the same for 
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any orientation of the magnetic field, and by (5), @ = Gire = 4, ex- 
pressed in units in which the volume of the Brillouin zone is unity. Such 
a state of affairs would be present if the conduction electrons were free 
and felt no perturbing influence from the lattice. 

In real copper the electrons are, of course, not free, and the Fermi 
energy surface deviates from a simple sphere. Pippard’ in 1957 was the 
first investigator to successfully derive a phenomenological, nonspherical 
Fermi surface for copper. Speaking picturesquely, the surface he found 
from anomalous skin effect measurements possesses “‘arms”’ or “necks”’, 
which extend outward from the Fermi surface, that actually contact the 
hexagonal Brillouin zone faces associated with the <111> directions, i.e., 
the [111] and equivalent directions. Each area of contact is about 10 
per cent of the area of one of the hexagonal zone faces. The Fermi sur- 
face of copper therefore consists of a sequence of sphere-like **bodies”’ 
situated on a body-centered lattice that are completely interconnected 
by “‘arms”’ which lie along <111> directions. 

An “arm” along one of the <111> directions supports open orbits whose 
net direction is along the same <111> direction. However, in copper (and 
surely in various other metals) the shape of the “bodies” and ‘tarms”’ 
is such as to provide support for open orbits along additional directions. 
The origin of the support for additional open orbits is straightforward 
and is discussed elsewhere.’ Resistivity measurements bear out in detail 
the presence of the primary open orbits along <111> directions, as well 


as additional open orbits that the <111> “arms” support,’ principally in 
the <100> and <110> directions. 


Fig. 5 Contours on the Pippard Fermi surface in the ?, = 0 plane transverse 
to the {001| axis. The topological path of investigation proceeded along the dotted 
line and around the contours in the direction of the arrows. This plane was deter 
mined to be electron-like since at the initial point EK « y 
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The periodic energy surface determined by Pippard has been expressed 


. ‘ . ° 9 . . 
to an accuracy of 1 per cent by Garcia-Moliner’ in terms of a three- 


parameter Fourier series. His analytical expression has been employed 
in caleulations on IBM 704 and 7090 computers to study the Hall 
constant for various directions of the magnetic field, based on the pro- 
cedure developed in this paper. In these calculations, electron-like, hole- 
like, and open orbits were all encountered. Some of the contours that 
are present when the magnetic field is along either a <100>, <110>, or 
<111> type. direction are shown in Figs. 5 through 9. In these figures 
the path of investigation is explicitly exhibited by which the relevant 
area was determined according to the procedure indicated in Fig. 4: 
The dotted line denotes that part of the path, always proceeding to the 
right, for which contours are being sought. The heavy line or lines repre- 
sent the part of the path on which, in the direction indicated by the 
arrows, a contour on the Fermi surface is being followed. 

Figs. 5 through 7 represent the behavior in three parallel (O01) planes, 


[100] ee 


Fig. 6 — An open contour on the Pippard Fermi surface in the P, = 0.585 {if 
center to (OOL) face of Brillouin zone 1] plane transverse to the [O01] axis. 
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Fig. 7 — Contours on the Pippard Fermi surface in the P, = 0.575 plane trans 


verse to the {OOL| axis. This plane was determined to be hole-like since at the ini 
tial point FE < EF 


the first at P 0, which shows the contours of electron-like behavior 
around the “belly”? of the Fermi surface. Fig. 6 shows an open orbit, 
ideally belonging to a set of measure zero, located on the (OOL) plane at 
P 0.585, where for present purposes the distance to the (OOL) face of 
the Brillouin zone boundary is taken as unity. Just a short distance be- 
low this plane, at P? 0.575, the carrier behavior is quite different, be- 
ing hole-like; the Fermi surface contours for this plane are shown in 
Fig. 7. 

In Fig. 8, hole-like contours are illustrated that lie in the P 0 


plane when the magnetic field is along the [O11] axis. Oscillations in gold 


and silver of similar so-called ‘“‘\dog-bone”’ orbits have been observed in 
de Haas-van Alphen experiments. — 


Fig. & Contours (‘‘dog-bone’’ orbits) on the Pippard Fermi surface in the 
P, =0 plane transverse to the /O11] axis. This plane was determined to be hole 
like since at the initial point EF < F 
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Fig. 9 — Contours on the Pippard Fermi surface in the P, = 0.866 plane trans 
verse to the [111] axis, which is tangent to one of the hexagonal faces of the Bril 
louin zone. This plane was determined to be hole-like, since at the initial point 


lig. 9 shows the contours found when the magnetic field was in the 
(111] direction and the particular plane of investigation lay on the sur- 
face of the Brillouin zone, i.e., P I/3 = 0.866, expressed in the 
same units as above. The small contour represents a cross section of one 
of the “necks”? which connects the sphere-like “bodies.” However, the 
large hole-like contour, which surrounds the small one, determines the 
behavior in the plane for purposes of the Hall constant calculation. An 
investigation of the contours that terminated before the first big con- 
tour had been reached would have given an erroneous result, although 
it would have covered all of the area within the first Brillouin zone. 

In the planes transverse to either one of the two types of symmetry 
directions, <100> or <111>, closed orbits of both eleetron- and hole-like 
character exist in different planes, this being possible because, by rota- 
tional symmetry, only a finite (hence negligible) number of intermediate 
planes can occur that contain open orbits. This is not the case for copper 
if the magnetic field lies along a <110> direction, and the analysis of this 
particular direction requires an individual study, which will be given 
presently. For the former two symmetry directions the predicted and 
observed" values of the geometrical factor G— divided by the free- 
electron value G are tabulated along with the Hall constant Ry in 
Table I. The ratio GG, measures the effective number of metallic elec- 
trons that are contributed per atom. The observed Hall voltage is very 
sensitive to field orientation in the neighborhood of the <111> direction," 
and accounts for the increased experimental uncertainty. This extreme 
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sensitivity with orientation can be readily understood: It is due to the 
rapid transformation of hole-like regions similar to that illustrated in 
Fig. 9 into electron-like regions as the magnetic field orientation deviates 
from a <111> axis. 

When the magnetic field is in, say, the [O11] direction the Fermi sur- 
face of copper supports not only planes of hole-like or electron-like be- 
havior separated by a finite number of planes with open orbits, but also 
a small yet nontrivial band of planes that support [100|-directed open 
orbits. In principle, these open orbits are broken into very large but 
nevertheless closed orbits if the magnetic field is reoriented very near 
to the [011] direction with a slight component along the [100] direction. 
If the angle of deviation from the [011] direction is small, then only an 
an extremely small number of any new open orbits could have been in- 
troduced. Consequently, it appears quite permissible to treat the Hall 
constant in the immediate neighborhood of a <110> direction as numer- 
ically significant 7f the magnetoresistance exhibits a point of saturation 
in the proximity of the <110> axis, which indicates that at some point 
open orbits were actually absent experimentally, and if the measured 
Hall voltage is a smooth function of the magnetic field orientation in 
that vicinity. The experimental results support both of the requirements 
in question: The magnetoresistance shows a point of saturation near the 
<110> axes,* and the Hall voltage shows a broad flat peak in the neigh- 
borhood of the <110> directions.*:!! Thus this Hall measurement is signifi- 
cant, and it was used to obtain the observed Hall constant for the <110> 
directions shown in Table I. The theoretically calculated value for Table 
I was computed by assuming that the magnetic field was along the 
(O11 
from the hole-like regions. The band of planes with open orbits, which 


| axis, so as to determine the principal contribution, including that 
was actually detected, was then reinterpreted as a band of planes hav- 
ing electron-like behavior, the behavior each of these planes would ac- 
quire if the magnetic field were tilted ever so slightly in the [100] direc- 
tion. 

The precision of the various Hall constant predictions is estimated at 
| to 2 per cent. Better agreement with the experimental Hall constants 
can be obtained if a Fermi surface model is used whose ‘tarms”’ or “‘necks”’ 
are increased in diameter. With fatter “arms,’? more hole-like planes, 
such as in Figs. 7 and 8, will be present for fields along <100> and <110> 


axes, and, as a consequence, higher Hall constants would be predicted. 


0 


a B : I 
It is interesting that the results of de Haas-van Alphen measurements 


* See Ref. 7, especially Fig. 1b, which shows the saturation behavior near to an 
{O11} axis. 
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Taste I—A Comparison BETWEEN THE PREDICTED AND OBSERVED 
HALL CONSTANT FOR THREE INEQUIVALENT FIELD 
ORIENTATIONS IN COPPER 


Predicted* Observedt 
Magnetic Field 
Directions 
Rut G/ Gere. 


<100> ‘ 10.3 0.59 + 
<110> By 10.2 0.65 + 
<111> mY 22.7 0.37 20.0 + 


* Based on the Garcia-Moliner® expression of the Pippard Fermi surface of 
copper. 

+ Based on the data of Kunzler and Klauder."! 

t The numbers in this column when multiplied by 10°'? represent the Hall 
constant expressed in volt-em/abamp gauss 


also suggest that a more realistic copper Fermi surface would have fatter 
“arms” than the Pippard model. 


ACKNOWLEDGMENTS 


It is a pleasure to thank C. Herring for bringing this problem to my 


attention and for his interest in this work. Many valuable conversations 
with J. EK. Kunzler are greatly appreciated, and a critical reading of the 
manuscript by E. I. Blount is acknowledged. The author wishes to thank 
Miss B. Cetlin for her advice and encouragement. 


REFERENCES 


1. Pippard, A. B., Phil. Trans. Roy. Soc., A250, 1957, p. 325. 
2. Lifshitz, I., Azbel, M., and Kaganov, M., J. Exptl. Theoret. Phys. (U.S.S.R.), 
$1, 1956, p. 31; Soviet Phys. J.E.T.P., 4, 1957, p. 41. 
3. Lifshitz, I., and Peschanskii, V., J. Exptl. Theoret. Phys. (U.S.S.R.), 35, 
1958, p. 1251; Soviet Phys. J..T.P., 8, 1959, p. S75ff. 
Kunzler, J. f., and Klauder, J. R., Bull. Am. Phys. Soc., 5, 1960, p. 150 
. Alekseevskii, N., and Gaidukov, Y., J. Exptl. Theoret. Phys. (U.S.S.R.), 36, 
1959, p. 447; Soviet Phys. J.E.T.P., 9, 1959, p. 311 
Priestley, M. G., Phil. Mag., 5, 1960, p. 111 
Klauder, J. R., and Kunzler, J. ., Proceedings of the International Conference 
on Fermi Surfaces, John Wiley & Sons, New York, 1961 
8. Klauder, J. R., and Kunzler, J. E., J. Phys. Chem. Solids, 18, 1961 p. 256. 
9. Gareia-Moliner, F., Phil. Mag., 3, 1958, p. 207 
10. Schoenberg, D., Phil. Mag., 5, 1960, p. 105. 
11. Kunzler, J. E., and Klauder, J. R., Phil. Mag., in press 








The Resistance of Organic Materials 


to Attack by Marine Bacteria 
at Low Tempe ‘atures 


By PRISCILLA L. STEINBERG 


(Manuscript received April 13, 1961) 


The use of various organic materials in submarine cable structures has 
brought about an interest in the anticipated behavior of such materials in a 
sea water exposure. Organic materials have been exposed in the laboratory 
at atmospheric pressure to marine bacteria at both 20°C and 5°C under 
aerobic conditions and 20°C under anaerobic conditions. The utilization of 
the test materials by the bacteria has been determined by either measure- 
ments of oxygen consumption or hydrogen sulfide production. 

None of the polyethylenes, synthetic fibers, or other thermoplastics such 
as polytetrafluoroethylene, polycarbonate, or polyamide was utilized by 
either aerobic or anaerobic microorganisms. All the elastomers tested were 
utilized by aerobic bacteria, but a majority of them were resistant to attack 
by anaerobic bacteria. Jute fibers served as a carbon source for both aerobic 
and anaerobic bacteria. In general, the poly(vinyl chloride) materials plas- 
ticized with an external plasticizer served as a source of energy for aerobic 
microorganisms at both temperatures, and six of these were also utilized 
by anaerobic bacteria. It appears that the added external plasticizer was 
responsible for the attack, since further tests showed the pure resin and the 
internally plasticized poly(vinyl chloride) to be resistant to both types of 
bacteria. 

Comparison of the data from 20°C and 5°C aerobic tests indicates greater 
microbial activity at the warmer temperature. Calculated rates of utilization 
of the susceptible materials showed that the elastomeric compounds were 
generally consumed ten times as fast when incubated at 20°C as at 5°C. 
The poly(vinyl chloride) compounds when compared similarly showed ap- 
proximately a five-to-one ratio, while jute fibers were utilized twice as fast 
at 20°C than at 45°C. 


1369 
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I. INTRODUCTION 


The interest of the Bell System in the extent of marine bacterial ac- 
tivity at great ocean depths stems from the knowledge that certain or- 
ganic materials are susceptible to attack by such organisms. For ex- 
ample, studies made by ZoBell and Beckwith! on rubber products and 
by ZoBell on hydrocarbons have indicated that marine bacteria are 
capable of utilizing these materials. Therefore, one phase of a biological 
testing program (see Fig. 1) initiated by the Laboratories in 1954 has 
been designed to determine the resistance of organic materials that were 
possibly applicable in ocean cable construction to attack by both aerobic 
and anaerobic marine bacteria. The first data collected from these lab- 
oratory tests have been reported earlier by Snoke,* and showed that in 
many instances specific organic materials were capable of being utilized 
by marine bacteria at 20°C, Attack was predominately by aerobes, little 
attack having occurred due to sulfate-reducing anaerobes. 

The 20°C temperature is representative of certain shallow water en- 
vironments and is in the range which supports high microbial activity. 
Ocean bottom temperatures at depths greater than 1000 meters range 
from 5° to —1.5°C.4 The present work, concerned with deep water tem- 
peratures, involves the testing of the resistance of the materials to aerobic 
bacteria at an incubation temperature of 5°C and the comparison of the 
results with those at 20°C. It was assumed initially that the rate of ac- 
tivity would be somewhat lower at 5°C, and since, in most cases, there 
had been little attack by anaerobic organisms at 20°C, no 5°C anaerobic 
tests were conducted. 


BIOLOGICAL TEST PROGRAM 
OF MATERIALS 


] | 











| 


LABORATORY TESTS } MARINE EXPOSURE i 








ANALYTICAL CONDUCTOR TEST | MARINE BORER TESTS ANALYSIS OF | 
TESTS CABLE | 


MEASURE MEASURE | WRIGHTSVILLE | DAYTONA FROM SERVICE| 
RESPIRATION OR BREAKDOWN BEACH, N.C. | BEACH,FLA 
BY-PRODUCTS OF OF DIELECTRIC L +4 —________ 

MICROBIAL 

ATTACK (MONTHS OR YEARS) 


-_——___ - — 


Fig. 1 Outline of marine biological test program 
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Il. BACKGROUND 

Marine bacteria are found in varying quantities over the entire ocean, 
with the greatest concentration being in coastal waters and in the region 
of the sediment-water interface. In spite of their minute size (generally 


less than one micron in length and one micron in diameter) they are so 


plentiful that the estimated mass of bacteria in the oceans has been 
placed at ten million tons.° The concentration at a particular time at any 
location reflects the existing balance between productive and destruc- 
tive factors. The numbers of bacteria are predominately due to the 
amount of organic matter present, since this provides the food source 
for the bacteria. An example of the influence of the quantity of organic 
matter on the bacterial population can be seen in the studies made by 
Hesse on water from the junction of the Atlantic Ocean and the colder 
Arctie Ocean.® The change in temperature encountered here kills large 
quantities of marine organisms, thus providing an increase in the or- 
ganic food supply, and has resulted in a great increase in the number 
of bacteria. 

Other environmental factors may also have some influence on the bac- 
terial population through effects on the supply of organic matter. It has 
been noted that a direct relationship exists between the bacteria and the 
diatoms, unicellular marine plants. These plants are capable of synthe- 
sizing organic matter from the simple chemical substances produced 
during the bacterial decomposition of extraneous organic material. 
Therefore, in areas of abundant diatoms there may also be an abnormally 
high number of marine bacteria. One area with a characteristically high 
population of marine bacteria is the mud-water interface. The popula- 
tion increase here is due to the rich food supply caused by constantly 
sinking marine plant and animal remains. Of this utilizable organic mat- 
ter, Waksman and Carey found that approximately 60 per cent con- 
sumed by the bacteria is oxidized to carbon dioxide and water, and that 
the remainder is used to build new bacterial cells.7 

Marine bacteria have been found in bottom deposits at depths of 
10,280 meters below sea level, where the hydrostatic pressure is approxi- 
mately 1000 atmospheres.* Cattell’ indicates that single-cell microorgan- 
isms can tolerate pressures up to 6000 atmospheres. Several species of 
bacteria studied by Sanborn!’ were active at a temperature of —5°C, 
which is slightly lower than typical bottom temperatures in the deep 
ocean. However, in spite of the ability of marine bacteria to adapt to 
different conditions, there are several factors that do restrict their num- 
ber in the ocean. First, the amount of organic matter in the open sea is 
limited, and of this supply a significant part cannot be utilized by the 
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bacteria. Second, bacteria are a major part of the food supply consumed 
by protozoa and other small marine animals. Third, as the rate of bac- 
terial multiplication increases, a toxic substance injurious to their de- 
velopment may begin to form and thus partly inhibit further growth. 

Temperature seems to influence the bacterial environment, mainly as a 
factor controlling the extent of activity, but with little effect on gross 
population figures. Studies have indicated that bacteria found in the cold- 
est waters multiply more slowly but live longer. ZoBell* has also found 
that this relationship holds true for sea water stored in glass receptacles 
at incubation temperatures ranging from 0° to 25°C. Temperature may 
restrict the number of species of bacteria present in a given location, but 
the only time it becomes a factor influencing the total bacterial popula- 
tion is prior to the establishment of an equilibrium within the system. 

This brief outline of marine bacteria illustrates the complexities facing 
the experimenter who is interested in trying to simulate their ecology. 
Since there are so many variables, any laboratory tests will necessarily 
be limited by the particular set of conditions satisfied during incubation 
of the test bottles. The tests reported here were conducted under labora- 
tory conditions using aged and filtered sea water, laboratory-incubated 
enrichment cultures, and small test samples. Due to the great number 
of samples, all could not be tested at one time. Also, any chemical 
changes which might occur in the materials due to prolonged exposure 
were not reflected. The data collected in these tests do provide immedi- 
ate information on the relative performances of the materials under the 
test conditions, and can be used as a gauge in judging materials until 
natural exposure data are available. 

The first results from early inspections of samples exposed directly 
in the ocean were reported by Snoke;’ more recent data will be made 


available in a future paper. 


Ill. DESCRIPTION OF LABORATORY TESTS 


The two types of laboratory tests conducted are designed to test the 
resistance of the various organic materials to both aerobic and sulfate- 
reducing bacteria. The latter test, since sulfate-reducing bacteria are 
ubiquitous, gives an indication of the extent of anaerobic attack. In both 
cases, the procedure consists in exposing the test materials to bacteria 
which must utilize the material as a carbon source if they are to live. 

Whenever possible, the materials to be tested are obtained in the form 
of film about 4 mils thick. Each bottle contains enough sample to ex- 


pose a surface area of 12.9 square centimeters, which, in the case of film, 


is a l-inch-square sample. The elastomeric materials are approximately 
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AGED OCEAN BOTTOM RAW TEST MATERIAL 
SEA WATER SEDIMENT SEA WATER (FILM) 








(a) SATURATE WITH 
FREE OXYGEN (AEROBIC) 
OR 


(b) REMOVE ALL ENRICHMENT 


FREE OXYGEN (ANAEROBIC) CULTURE 
(INOCULUM) 


=>} a 


TEST BOTTLE 




















Fig. 2 Flow chart of laboratory bacteriological test. 
75 mils thick and are therefore cut in }g-inch-square samples. It is diffi- 
cult to measure the surface area of granular samples and, as a result a 
selected mass, namely 0.05 gram, is used for all such samples or for jute 
fiber. The sample size for the other fibers, namely acrylonitrile-viny] 
chloride copolymer and 6-6 polyamide (Source Ib)*, was determined by 
an approximation of the desired 12.9 square centimeter surface area. 


3.1 Aerobic Bacteria 


The biochemical oxygen demand (BOD) type of test measures the 
amount of dissolved oxygen consumed by aerobic bacteria during an 
eight-week incubation of the organic materials in aged and inoculated 
sea water. Since the test samples supply the only source of carbon avail- 
able to the bacteria, the rate of oxygen consumption, which is a meas- 
ure of the respiratory process of the living system, becomes a direct 
measure of the utilization of the material. The flow chart representing 
the steps involved in the test preparation is shown in Fig. 2. 

At least eight weeks prior to running the test, sea water and marine 
sediment are collected from an area on the Atlantic coast. Some of the 
material to be tested is added to a portion of the collected sea water 
and sediment, and the mixture is inoculated from pooled marine en- 


* Indicates source of supply. 
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richment cultures obtained through the courtesy of C. E. ZoBell of 
Scripps Institute of Oceanography. These latter were prepared using a 
wide range of organic materials including elastomers, carbohydrates, 
coal products, and other aromatic compounds. The enrichment cultures 
made up for each test sample are incubated at 25°C for at least six weeks 
prior to testing. The purpose of preparing separate cultures is to enable 
bacteria capable of utilizing the materials to develop preferentially 
under ideal conditions. In the meantime, the remainder of the raw se: 
water is filtered and aged in the dark until the utilizable organic content 
is about one part per million or less. 

When these preliminary procedures have been carried out, the aged 
sea water is placed in a carboy and aerated with oxygen for 16 hours. 
Prior to filling the test bottles, a combined enrichment culture contain- 
ing a few drops from each of the incubated cultures of the sample group 
being tested is added to the aerated sea water in a concentration of 1 
milliliter per 10 liters of sea water. This concentration of inoculum pro- 
vides maximum microbial activity with the least amount of added or- 
ganic material. A slight positive oxygen pressure is placed on the water 
in the carboy until after the test bottles are filled, thus assuring oxygen 
saturation throughout the period preceding incubation. 

Ten replicates of each sample are incubated in 60-milliliter glass- 
stoppered bottles in a constant-temperature water bath. Normally, two 
replicate samples are sacrificed for measuring the dissolved oxygen of the 
water following 0, 1, 2, 4, and 8 weeks of incubation. However, when a 
high rate of attack is anticipated the incubation intervals are shortened 
in order to obtain significant titration data before all the available oxy- 
gen has been utilized by the bacteria. Usually six materials are included 
in each test run with one set of control bottles. The 10 control bottles 


included contain inoculated sea water and are prepared exactly the 


same as the test bottles except for the absence of test material. 

The dissolved oxygen content of the test bottles before incubation 
and following the different incubation periods is measured using a mod- 
ified Winkler method. The basic Winkler procedure consists of the three 
steps shown in Fig. 3: (1) oxidation of manganous hydroxide in a highly 
alkaline solution, (2) acidification of the product in the presence of an 
iodide, and (3) volumetric titration of the liberated iodine with a stand- 
ard thiosulfate solution using a starch indicator. 

The oxygen consumed is calculated in parts per million from the 
results of the titrations of the sea water prior to incubation and follow- 
ing the longest period of incubation. The difference between these two 
values over and above any loss of oxygen in the control samples con- 
stitutes the biochemical oxygen demand (BOD) with the test material. 





RESISTANCE OF ORGANIC MATERIALS 


WINKLER OXYGEN DETERMINATION IN BOD TEST 


(a) OXIDATION OF MANGANOUS HYDROXIDE IN A HIGHLY ALKALINE SOLUTION 
2H20 + O2 + 2MN(OH)2 —* 2Mn(Oh), 


(b) ACIDIFICATION IN THE PRESENCE OF AN IODIDE 
Mn(OH), + 4HCL —® MNCl2 + Cla + 4H20 


Cle + 2Nal —* 2NaCl +I, 


(C) FREE IODINE TITRATED WITH A STANDARD THIOSULFATE SOLUTION 


T2 + 2Na2S203 vem Na2S40¢ + 2NalI 
STARCH INDICATOR STARCH INDICATOR 
(BLUE) (COLORLESS) 


HYDROGEN SULFIDE DETERMINATION IN ANAEROBIC TEST 


HoS + Io — > 2HI +S 
(STARCH INDICATOR REMAINS BLUE FOLLOWING COMPLETION OF REACTION) 


Fig. 3 — Reactions involved in laboratory marine analytical tests. 


3.2 Anaerobic Bacteria 


In the anaerobic test procedure the volume of hydrogen sulfide pro- 
duced as a by-product during the utilization of the organic materials by 
anaerobic sulfate-reducing bacteria is measured as an indication of 
anaerobic attack. 


Except for the handling of the aged sea water, the preparations for 
the anaerobic test are quite similar to those for the BOD test (see Fig. 
2). In this test the sea water is boiled vigorously for 10 minutes and 
siphoned immediately into a carboy. A slight positive nitrogen pressure 
is placed on the carboy until the water is cooled enough to add the in- 
oculum. Additional nitrogen pressure is kept on the carboy while the 
bottles are filled in order to prevent free exchange of oxygen with the 
water. Since the anaerobes function much more slowly than the aerobic 
bacteria the incubation periods selected are 0, 4, 8, 12, and 16 weeks in 
the water bath at 20°C. 

Following incubation the hydrogen sulfide dissolved in the sea water 
is titrated with an iodine solution using starch as an indicator (see Fig. 
3). Hydrogen sulfide content calculated over and above that found in 
the inoculated controls is considered a measure of anaerobic attack of 
the material in test. 
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IV. MATERIALS TESTED 


The individual compounds which have been tested against aerobic 


and anaerobic organisms are listed in Table I. They included elastomers, 


poly(vinyl chloride), polyethylene, polypropylene, other thermoplastics, 
natural and synthetic fibers, and casting compounds. 


.1 Elastomers 


With the exception of silicone rubber, all rubber samples were com- 
pounded at Bell Telephone Laboratories, steam cured, and supplied in 
sheet form approximately 75 mils thick. Although a few materials were 
submitted as both jackets and insulation the composition of the two 
varied considerably in some cases, and the two compounds behaved 
quite differently in test. 

According to ZoBell and Beckwith,'! even a highly purified nonvul- 
canized rubber sample can be utilized by marine bacteria. The presence 
of numerous waxes, oils, soaps, stearic acid, and phenolic compounds 
used in the compounding of the elastomers tends to increase the suscep- 
tibility of the rubber compounds to microbial attack. To date, no work 
has been done to determine which components in the elastomers were 
primarily responsible for the extent of attack experienced in the lab- 
oratory. 


1.2 Poly( Vinyl Chloride) 


All poly(vinyl chloride) samples submitted in sheet form were ap- 
proximately 2 to 5 mils thick. All of the known plasticizers in the ex- 
ternally plasticized poly(vinyl chloride) samples contained typical or- 
gano-metallic-type stabilizers (such as barium, cadmium, and lead), 
fatty acid lubricants in low concentrations, and, in some cases, small 
quantities of inorganic fillers. 

The unplasticized, rigid poly(vinyl chloride), although not containing 
a specific external plasticizer, was compounded with a significant amount 
of fatty acid lubricant and is therefore compared throughout the text 
with the externally plasticized compounds. This compound contained 
approximately eight to ten times as much fatty acid as was used in com- 
pounding the two internally plasticized acrylate copolymers designated 
semiflexible copolymers A and B. The rigid poly(vinyl chloride) should 
not be confused with poly(vinyl chloride) resin also discussed here, 
which was a pure poly(vinyl chloride) powder containing no added 


plasticizer or lubricant. Polyester A, C, and FE plasticizers were fatty 
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TABLE | 


MATERIALS 


MATERIALS TrESTED AGAINST AEROBIC AND 


ANAEROBIC MARINE BACTERIA 


Material 


Elastomers 


GR-S jacket 

Natural rubber jacket 

Butyl rubber jacket 

Neoprene red lead jacket (with clay) 
Neoprene red lead jacket (without clay 
Natural rubber insulation 

GR-S insulation 

Silicone rubber 

GR-A jacket 

Neoprene (ZnO, MgO jacket) 


Poly(vinyl chloride 


Polyester E (BTL 46-55) 

Polyester A 

Di-2-ethylhexyl phthalate (DOP), Source 
None rigid 

Tri-2-ethylhexyl phosphate (TOP 

Nitrile rubber/polyester C 

Nitrile rubber 

Tricresyl phosphate (TCP) — sheet 
Di-2-ethylhexyl phthalate (DOP), Source : 
Proprietary compound 

1)i-2-ethvlhexyl phthalate (DOP), Source :‘ 
Tricresy! phosphate (TCP) — granular 
Di-2-ethylhexyl phthalate (DOP), Source 
None (semiflexible copolymer A) 

None (semiflexible copolymer B) 

None (PVC resin) 


sheet 


granular 


Polyethylene 


0.2 melt index + antioxidant sheet 

0.2 melt index + antioxidant + 5°) butyl rubber-sheet 

0.2 melt index (Source B) sheet 

2.0 melt index (ASTM 11238) granular 

0.2 melt index (Source A) granular 

0.2 melt index (Source B) granular 

0.2 melt index + antioxidant granular 

0.2 melt index + antioxidant + 5° butyl rubber 
granular 

0.7 melt index (high density) natural + antioxidant 
sheet 

0.7 melt index (high density) + carbon black + antioxi 
dant sheet 


Polypropylene 


Polypropylene + antioxidant A 
Polypropylene + antioxidant B 


Chemical Department 
Designation 


moro 
i ~ 
WO 


i) 


or 
oe te ee 


Vor 
—_— 


plasticizer used 


P5503115 
BTL 24-54 
P5606412 
P5503087 and P5402049 
P5502081 
P5502074 
P5502076 
P5311580 
P5511691 
P5511690 
P5502082 
BTL 529-53 
BTL 23-54 
P5502078 
P5502077 
P5510645 


P5308390 
P5602065 and P5403122 
P5312587 
P5310466 
P5304156 
P5312587 
P5308390 
P5308396 


P5512792 
P5503133 
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TABLE I—COoNTINUED 


ee Chemical I epartme nt 
Designation 


Other thermoplastics 


Cellulose acetate 

Cellulose triacetate 

6-6 polyamide (Source Ia) 
6 polyamide (Source IT) 
6-10 polyamide (Source III 
Polycarbonate 

Polyether 

Polyethylene terephthalate 
Polytetrafluoroethylene 
Trithene film 


Fibers 


Aecrylonitrile-vinyl chloride copolymer 
6-6 polyamide (Source Ib 


Casting compounds 


Kk poxide LRM 392, L2 (sheet) 
Styrene-polyester, silica filled LRM 326, Li 


(sheet ) 


acid compounds with polyester C having been compounded with nitrile 
rubber for use as a test material. 

Two of the compounds containing di-2-ethylhexyl phthalate (DOP) 
as the external plasticizer are labeled Source 1 and Source 2, and may 
be differentiated by the Shore A hardness test.* Source 1 had a Shore 
A figure of 88, indicating that it contained more plasticizer than Source 
2, which had a Shore A figure of 62. The concentration of di-2-ethyl- 
hexyl! phthalate in Source 3 is not known. 


1.3 Polyethylene 


The 0.2 melt index (Source A) and 2.0 melt index samples were pre- 
pared only in granular form. Data for some of the other polyethylenes 
were tabulated for both sheet and granular forms, but comparisons were 
only made when both series of tests involved the use of a similar form 
of material. 


1.4 Polypropylene 


Both polypropylene samples were submitted in sheet form about 15 
mils thick. One sample contained antioxidant A, which included a phe- 
nolic compound, and the other contained an unknown antioxidant (B). 


* See ASTM [D-1706-59 T 
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4.5 Other Thermoplastics 


Most of the other thermoplastics tested were supplied as films between 
1 and 5 mils thick. Of the polyamide samples tested, Source Ia is a 6-6 
polyamide, Source II is a 6 polyamide, and Source III is a 6-10 poly- 


amide. The cellulose acetate sample contained 54.5 per cent acetic acid, 
while the cellulose triacetate contained 61.5 per cent acetic acid. 


1.6 Fibers 


The jute used was an untreated roving. Acrylonitrile-viny! chloride 
copolymer and 6-6 polyamide (Source Ib) were both supplied as yarn. 
Since both of these latter materials had been subjected to spinning oils 
during manufacture, they were exposed in test both as unwashed and 
washed samples. Petroleum ether was used as a wash medium. 


1.7 Casting Compounds 


The granulated epoxide compound was a casting resin set up through 
the use of an epoxy hardener. The sheet compound was a silica filled 
epoxy resin which had been pigmented with titanium oxide and hard- 
ened with an amine hardener. The resulting compound was quite brittle 
and about 13 mils thick. 

The granulated styrene-polyester compound was also silica filled. The 
similar sheet material provided was filled with silica (89.7 parts) and 
glass fiber (5 parts) and pigmented white with TiO, . This compound 
Was approximately 10 mils thick and was very brittle. 


V. TEST RESULTS 


In discussing the results of this test program the word “‘attack’’ is 
used quite freely when describing the utilization of organic materials by 
the bacteria. The word ‘tattack”’ is used here to indicate that a material 
can serve as a source of food for marine bacteria rather than to indicate 
that it suffers rapid visible deterioration. In natural exposure this 
“attacked” material might perform quite satisfactorily for many years; 
in fact, many similar materials have been used successfully in marine 
environments over the years. However, the accelerated test data are 
valuable in screening various materials as to their comparative expected 
performances. Such data are of value in the selection of materials for 
submarine applications until long-range tests have been in operation 
for a considerable length of time. Also, when the natural exposure data 
do become available they can be correlated with the laboratory figures 
to provide additional information. 
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Figs. 4 and 5 show the extent of attack of the elastomers and the 
externally plasticized poly(vinyl chloride) samples as functions of time. 
They are based on data collected from the Winkler titrations, which 
measure the dissolved oxygen content of the sea water after the various 
incubation intervals. Each individual point on the curve was calculated 
as follows: 


B, 


xX 100 — C per cent of oxygen consumed 
during particular incubation interval, 


where 


A = dissolved oxygen (ppm) present in test bottles prior to incuba- 
tion (average of two replicate samples), 

B, = dissolved oxygen (ppm) present in test bottles following the 
incubation period under consideration (average of two replicate sam- 
ples), 

( = per cent oxygen loss in the control or inoculated sea water bottles 
during the same incubation interval (2). 


Whenever the available supply of dissolved oxygen was utilized en- 
tirely, no control factor (C) was used, and the result was reported as 
100 per cent consumption. In general, the loss of oxygen in the control 
bottles during eight weeks’ incubation at 20°C was between 10 and 15 
per cent of the total dissolved oxygen present. In the tests run at an in- 
cubation temperature of 5°C, 5 to 10 per cent of the total oxygen was 
usually lost during the same interval. 

In a few instances of low oxygen consumption the two-week titration 
figure showed more dissolved oxygen remaining than at the end of one 
week of incubation. Such anomalies may result from the fact that either 
(a) the control bottles showed more loss of oxygen than usual after two 
weeks of incubation or less than usual in the bottles titrated following 
one week, (b) the systems were not exact replicates, or (c) there was a 
small variation in the initial supply of oxygen. In such instances the 


higher oxygen consumption value was entered for both test periods to 
reflect the higher rate of attack. 


To permit a ready comparison of the attack of different materials by 
aerobic bacteria, an “index of oxygen consumption” was calculated for 
each material. This index is the average daily consumption for the in- 
cubation period up to 100 per cent consumption or to the longest test 
interval (eight weeks), whichever is shorter. Thus the units of the index 
are properly ‘‘per cent oxygen consumed by the bacteria per day.”’ It is 
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Fig. 5 Individual biochemical oxygen demand (BOD) test results on poly 
(vinyl chloride) plastics; samples are designated by plasticizer used 
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preferable, however, to disregard the units in dealing with the index, 
since the consumption rate is seldom linear; hence the index cannot be 
used to calculate oxygen consumption reliably. Its value is in providing 
a single figure which qualitatively reflects relative rates of attack while 
the curves of Figs. 4 and 5 provide more detailed comparisons. 

The indices of oxygen consumption figures for all test materials are 
listed in Table II. In the interpretation of these data, a material was 
arbitrarily judged a potential food source for the marine bacteria if the 
index of oxygen consumption exceeds 0.5 per cent per day in the 20°C 
tests. Such materials are indicated by asterisks. Also, if BOD test results 
showed a system to be capable of utilizing more than 0.1 per cent of the 
available oxygen per day in the 5°C tests, the quality of the performance 
of this material was questioned. 

The comparison of the indices calculated for the individual materials 
at 5° and 20°C has been made by a general grouping of the materials 
according to their rates of utilization, and may be found in Table III. 

A list of materials that were not attacked in aerobic or anaerobic tests 
may be found in Table IV. 


VI. DISCUSSION OF TEST RESULTS 


Curves drawn for all of the externally plasticized poly(vinyl chloride) 
samples and the elastomers (ligs. 4 and 5) suggest that there are three 
general patterns of oxygen consumption by the marine bacteria. Essenti- 
ally these are as follows: Type A — oxygen consumption rate follows a 
linear pattern as a function of time; Type B — oxygen consumption rate 


increases as the length of incubation time increases; and Type C — oxy- 


gen consumption rate decreases as the length of incubation time in- 
creases. In all cases involving the BOD testing of elastomeric materials 
at 20°C the oxygen was consumed in a manner illustrated by the Type 
A slope, in which the rate of oxygen consumption appears to be a linear 
function dependent on the length of incubation time. However, in most 
of these tests the total available oxygen was utilized so quickly that 
very little titration data were collected, and it was rather difficult to 
generalize concerning the manner of attack of the materials. In many 
cases only two points could be considered, since the exact time at which 
100 per cent of oxygen had been consumed was not known. When the 
same materials were incubated at 5° only neoprene (containing zine 
oxide and magnesium oxide) jacket was attacked at a rate directly pro- 
portional to time. GR-A jacket was utilized more readily at the begin- 
ning of the test, and the remainder of the elastomers were utilized more 
easily as the incubation time increased. 
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TABLE III—ComPpaRISON oF 20° AND 5°C BOD Test Resu.tts 
BAsED ON INDEX OF OXYGEN CONSUMPTION 


Per cent oxygen consumed per day 
Material tested 


20°C test 5°C test 


Elastomers 


GR-S jacket 

Natural rubber jacket 10-20% 
Neoprene red lead jacket (with clay) 

Neoprene red lead jacket (without clay) 

Natural rubber insulation 

GR-S insulation 


GR-A jacket under 10% under 1% 
Neoprene (ZnO, MgO jacket) 


Butyl rubber jacket 10-20% under 1% 
Silicone rubber under 10% 

Jute 
Untreated roving 

Poly(vinyl chloride), designated by plasticizer 


Polyester E over 10% over 2% 
Polyester A 


Di-2-ethylhexyl phthalate 2.5-10% 
(DOP) Source 1 
None (rigid) 


Tri-2-ethylhexyl phosphate (TOP) 
Nitrile rubber/polyester C .§-2.! 0.1-0.5% 
Nitrile rubber 

Tricresyl phosphate (TCP) 

1i-2-ethylhexyl phthalate (DOP) — Source 3 

Proprietary compound 

Di-2-ethylhexyl phthalate (DOP) — Source 2 


kxamination of the individual curves containing the BOD data for 
the externally plasticized poly(vinyl chloride) plastics shows some sim- 
ilarity to the rates of attack found in the rubber samples. At 20°C nine 
different test materials were utilized essentially in a manner which would 
fall into the straight-line or Type A oxygen-consumption rate curve. 
The two exceptions were the rigid unplasticized poly(vinyl chloride), 
which began to be utilized more slowly as incubation time increased, 
and the poly(vinyl chloride) plasticized with tricresyl phosphate, whieh 
began to be attacked more rapidly as time increased. At the 5°C incu- 
bation temperature only the poly(vinyl chloride) plasticized with poly- 
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TABLE I1V—MarTeriats Not ATrackepD IN EITHER 
BOD or ANAEROBIC TESTS 
Poly(vinyl chloride) 
Semiflexible copolymer A (no plasticizer added) 
Semiflexible copolymer B (no plasticizer added) 


Pure poly(vinyl chloride) resin 


Polyethylene 


melt index, standard molecular weight 

melt index (Source A), high molecular weight 

melt index (Source B), high molecular weight 

melt index + antioxidant 

melt index + 5% butyl rubber + antioxidant 

melt index (high density) + antioxidant 

7 melt index (high density) + carbon black + antioxidant 


“Ibo bo bo bo 


Polypropylene 


Polypropylene + antioxidant A 
Polypropylene + antioxidant B 


Other thermoplastics 


Polyamide (Sources Ia, II, and III) 
Polyethylene terephthalate 
Trithene film 
Polytetrafluoroethylene 

Polvyether 

Cellulose triacetate 

Cellulose acetate 

Polycarbonate 


Casting compounds 


Epoxide casting compound 
Stvrene-polyester, silica filled 


Fibers 
Acrylonitrile-vinyl chloride copolymer 


6-6 polyamide (Source Ib 


ester E was utilized following the Type A pattern, while five samples 


the rigid unplasticized poly(vinyl chloride) and those plasticized with 


? 


polyester A, nitrile rubber, DOP Source 3, and a proprietary plasti- 
cizer — were attacked more slowly with an increase in incubation time. 
The five other poly(vinyl chloride) plasticizers —- DOP Source 1, TOP, 
nitrile rubber/ polyester C, TCP, and DOP Source 2 —— were more easily 
attacked as incubation time increased. 

It is highly possible that the slope variations noted in the individual 
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Type B curves could have resulted from testing under laboratory con- 


ditions. For instance, when a material has been attacked quite slowly 


at the beginning of a test and then more rapidly as the test progresses, 
it may be due to the fact that an equilibrium had to be established within 
the confines of the 60-milliliter test bottle before the bacteria could 
function effectively against the test material. Or perhaps the bacteria 
were unable to attack until a leachable plasticizer or lubricant or other 
compounding material had been released into the sea water in a form 
or amount that would be acceptable to the bacteria. Another factor 
which could be partially responsible for this type of situation was the 
fact that the enrichment cultures which were used for the 5°C tests were 
originally incubated at 25°C; this temperature could have temporarily 
inactivated bacteria, allowing them to become active only after in- 
cubation at 5°C. 

With the Type C curve, laboratory conditions could also be held 
partially responsible for its shape. Possible explanations for a greater 
rate of attack at the beginning rather than at the end of the test might 
be that toxic materials were being released into the sea water, thus con- 
tinually diminishing the rate of attack, or perhaps that the establish- 
ment of an equilibrium brought about a dormant system causing a de- 
crease in microbial activity. This suggests that attack of the same 
materials in a natural marine exposure might follow a different pattern. 

In comparing the curves for the same materials under different incu- 
bation temperatures it was noted that the only materials which showed 
similar patterns of attack at both temperatures were the poly(vinyl 
chloride) samples plasticized with polyester E and TCP, the unplasti- 
cized rigid poly(vinyl chloride), and the elastomer neoprene (containing 
zine oxide and magnesium oxide) jacket. In all cases the rate of utiliza- 
tion was noticeably higher during the 20°C test. 

In the remainder of the testing of rubber compounds the greatest 
difference between the patterns of oxygen consumption at 20° and 5 
was noted during the first two weeks of incubation. The temperature 
differential appeared to be responsible for the length of incubation time 
needed to set up an active microbial system. This situation was also en- 
countered in the testing of the four poly(vinyl chloride) samples plasti- 
cized with DOP Source 1, TOP, nitrile rubber/ polyester C and DOP 
Source 2 at 5°C, in which the oxygen consumption rate increased as 
incubation time increased. These same materials followed the Type A 
pattern of oxygen depletion when incubated at 20°C. 

The remaining four externally plasticized poly(vinyl chloride) plastics 
were attacked essentially in direct proportion to the extent of time in 
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test when incubated at 20°C, but exhibited a higher rate of attack at 
the beginning than at the end when tested at 5°C. 

One other interesting feature was noted when comparing the curves 
of the individual materials at the two incubation temperatures. In a 
few cases, such as with the DOP Source | plasticized poly(vinyl chloride) 
sample, there was a short interval in both curves where the slope had 
leveled off. In the cited example this interval represented the second 
week of the 20°C test and the third and fourth week of the test, when 
the temperature was held at 5°C. 

In comparing the indices of oxygen consumption of the individual 
materials at the two temperatures, the most noticeable feature appears 
to be that the elastomeric materials, when incubated at 20°C, were 
utilized essentially ten times faster than the same materials incubated 
at 5°C. Furthermore, examination of the relationship between BOD 
tests at 20° and 5°C involving the externally plasticized poly(vinyl] 
chloride) samples shows that the ratio of attack was approximately 5 
to 1. The ratio of the rates of attack witnessed in the BOD testing of 
jute was 2 to 1. No other ratios were noted for less utilized material, 
since the indices for both 20° and 5° data were too small to allow any 
accurate comparisions. 

This classification system does not necessarily mean that the test 
data for each material exhibit this type of ratio. For instance, the rela- 
tionship of attack at the two temperatures for polyester E plasticized 
poly(vinyl chloride) would be 14.38 to 5.1, for a ratio of 2.8 to 1 instead 
of 5 to 1, and for natural rubber insulation it would be 7.8 to 1 instead 
of 10 to 1. One extreme case was the BOD results for neoprene (contain- 
ing zinc oxide and magnesium oxide) jacket, which gave the material 
an index of 2.8 per cent per day and a place on the ‘“‘under 10 per cent” 
for 20°C data, while the 5°C index of 0.1 per cent per day correctly placed 
the material on the related ‘‘under | per cent’’ list. However, this pro- 
duced an actual 28.0 to 1 ratio in the rates of attack. This is a broad 
classification of the performance ratios and, although most of the mate- 
rials fall essentially within the assumed ratios, it is not infallible. 

The materials which failed to fall into this group relationship were 
the elastomers butyl jacket and silicone rubber. The ratio of the results 
of the silicone rubber BOD tests were 9.7 to 1.4, thus being fairly close 
to the range of anticipated behavior. The placement difficulty came 


from the choice of 10.0 rather than 9.0 or even 9.7 as a breaking point 


between the two groups, and does not indicate any peculiarity of sample 
performance. In the case of butyl rubber jacket, the BOD tests have been 
rerun in an effort to determine its odd behavior when compared to the 
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remainder of the elastomers. However, in all tests conducted to date 
butyl jacket has not been readily attacked at 5°C. The reason for this 
phenomenon is not fully understood. 

In studying these figures it may be noted that the elastomers were 
utilized by aerobic bacteria more readily than was any other group of 
materials. The total available oxygen was consumed in all of the ten 
BOD tests run on elastomers at 20°C. The time required for complete 
consumption ranged from 7 to 28 days with the indices of oxygen con- 
sumption varying from 19.9 to 2.8 per cent per day. In the 5°C tests 
the elastomers were usually attacked but at a slower rate, and butyl 
jacket, GR-A jacket and neoprene jacket containing zine oxide and 
magnesium oxide still showed some resistance after eight weeks of in- 
cubation. Only three of the elastomers, namely butyl and natural rub- 
ber jackets and natural rubber insulation, served as a source of energy 
for the anaerobic bacteria. 

The remaining general class of materials which provided a food source 
for the marine bacteria was the externally plasticized poly(vinyl chlo- 
ride) samples. In all poly(vinyl chloride) tests that showed high oxygen 
consumption the materials in question had been compounded with 
fatty-acid-type lubricants or external plasticizers that contained fatty 
acids. Three of the group which were studied and showed no attack 
were the pure resin and two internally plasticized semiflexible copoly- 
mers. These observations indicate that the plasticizer or lubricant and 
not the poly(vinyl chloride) itself was utilized as a source of carbon. 
With incubation at 5°C the rate of attack was much lower, but the 


samples plasticized with polyesters IE. and A, fatty acid compounds, were 


attacked considerably faster at both temperatures than the remaining 
poly(vinyl chloride) samples. Also, these samples were readily attacked 
by anaerobes during 16 weeks of incubation. 

In all tests conducted jute fiber was readily utilized, but it is rather 
difficult to compare rates of attack on jute with the various sheet mate- 
rials due to the great variation in material surface area made available 
to the bacteria. The performances of cellulose acetate at 5°C and sty- 
rene-polyester and polypropylene containing antioxidant B at 20° were 
close to the arbitrary 0.5 per cent per day consumption index, but can- 
not be reported as definitely showing attack. 

The majority of the anaerobic test data available was reported pre- 
viously by Snoke* but are also presented here in Table I] in order to 
include all the laboratory findings in a single document. In general, only 
materials that were attacked by aerobic organisms have tended to show 
attack by anaerobic sulfate-reducing bacteria, and usually this was to a 
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lesser degree and over a longer period of incubation. Since the rate of 
attack was usually very slow, the anaerobic tests were only run at 20°, 
where, it was assumed, the higher hydrogen sulfide production would be. 
There were three cases of doubtful performances in this group, natural 
rubber insulation, poly(vinyl chloride) DOP Source 1 plasticizer, and 
the polypropylene sample containing antioxidant B. 


VII. SUMMARY 


The following general statements are made regarding the relationship 
between the 5° and 20°C BOD tests run on 57 materials to date: 

1. Oxygen was consumed significantly during incubation at both 
temperatures for all the externally plasticized poly(vinyl chloride) 
samples and for all the elastomeric materials except neoprene jacket 
containing zinc oxide and magnesium oxide when incubated at 5°C. 
Jute was also utilized at both temperatures. 

2. In each of the above exceptions the microbial activity, measured 
by the oxygen consumed, was greater during the 20°C incubation tests. 
The ratio of the rates of attack at the two temperatures for the elasto- 
mers was generally 10 to 1, while it was 5 to 1 for the externally plas- 
ticized poly(vinyl chloride) samples and 2 to | for the untreated jute 
roving. 

3. No one figure could be used to describe adequately the rate of 
attack in any test, since the rates appeared to vary during the several 
intermediate incubation intervals. Sometimes the rate of attack seemed 


to be in direct proportion to the length of time of incubation, but quite 


often the rates either decreased or increased with an increase in incuba- 
tion time. The type of rate curve for the same material at the two tem- 
peratures was often dissimilar. 

$. One number was selected for each material representing an as- 
sumed average oxygen consumption per day. This figure, designated the 
Index of Oxygen Consumption, is presented as a means of simplifying 
the presentation of the data obtained from each test. 

5. The oxygen consumption with three materials — cellulose acetate 
when incubated at 5°C and silica-filled styrene-polyester and poly- 
propylene containing antioxidant B at 20°C — although very low, was 
somewhat higher than comparable control values. 

6. The following were not attacked at either temperature: 

a. Polyethylene 
b. Internally plasticized poly(vinyl chloride) copolymers and pure 
poly(vinyl chloride) resin 
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ce. Other thermoplastics with the possible exception of cellulose 
acetate at 5°C 

d. Epoxide casting compound 

e. Synthetic fibers acrylonitrile-vinyl chloride copolymer and 6-6 
polyamide (Source Ib). 

Although incubated for a longer time, the samples exposed to an- 
aerobic conditions showed less utilization by sulfate-reducing bacteria. 
Only three elastomers, six externally plasticized poly(vinyl chloride) 
samples, and jute showed noticeable attack following 16 weeks incuba- 
tion under anaerobic conditions at 20°C. 
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The Influence of Crystallization Conditions 
on Radiation Effects in Polyethylene — 
I. Crystallization from Dilute 
Solution and from the Melt 


By R. SALOVEY and A. KELLER 


(Manuscript received January 16, 1961) 


Samples of Marlex 50 crystallized from dilute solution (yielding single 
crystals) and in the bulk form were irradiated by high-energy electrons and 
the solubilities compared. The single-crystal specimens remained fully 
soluble after a dose of 20 megarads, which rendered the bulk two-thirds to 
three-quarters insoluble. Since the initial free-radical concentration was 
identical and the chemical changes induced were not widely different in 
the two kinds of specimens, it ts suggested that intramolecular crosslinks, 
occurring tn a larger number in the single crystals, are responsible for this 
behavior. At high doses the same solubility limit is approached by both 
spectmens, which ts ascribed to the possibility of chain cutting becoming 
increasingly ineffective in the single crystal. In the light of these findings, 
the limitations of the conventional treatment of radiation effects are dis- 
cussed and the possible significance of the present results for morphological 
problems is pointed out. . 

A number of further effects are reported in connection with melting be- 
havior and spin resonance spectra that are at variance with results in the 
literature. 


I. INTRODUCTION 


The effect of ionizing radiation on polyethylene has been extensively 
studied in the past'® but no investigation has been reported concerning 
the irradiation of polyethylene single crystals.’ We have examined the 
influence of morphology on the irradiation behavior of polyethylene. 
These studies are purely exploratory and many of the physical tools 
involved were used only in “key” tests. We did not intend to pursue 
every effect found in the course of these experiments as each might 
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merit a specific research program. Indeed, the present research was 
largely tailored so as to be accommodated by the limited tenure of one 
of us* at Bell Telephone Laboratories. For this reason, some experi- 
ments which appear to be obvious corollaries of the present work could 
not be performed, and a rather arbitrary line had to be drawn to termi- 
nate this exploratory work. Nevertheless, we feel that our findings in 
their present state convey sufficient information of value to have it 
published in its present form. 

The irradiation behavior of polyethylene is a difficult and controver- 
sial subject, which is being extensively and carefully studied in different 
laboratories.‘-'° We have attempted to minimize uncertainties by em- 
ploying a primarily comparative approach to the subject. In trying to 
establish differences between morphologically different polyethylenes, 
specimens under comparison received strictly identical treatments. 
Consequently, the many uncertainties that beset this field do not 
affect our principal experimental findings, which always refer to differ- 
ences in identically treated materials. 

This first paper will refer to single-crystal polyethylene prepared from 


dilute solution as compared with the melt-crystallized bulk specimen. 


We recall that such single crystals consist of uniform lamellae which are 
about 100 angstroms thick, the molecules approximately normal to the 
plane of the lamellae in a sharply and regularly folded configuration.* 
The bulk is usually considered as an aggregate of amorphous-crystalline 
fringed micelles. The recognition of single crystals and numerous morpho- 
logical observations support the idea of identifying similar structures in 
the bulk. The relation between single crystals and the bulk material is 
one of the key problems in present structural studies of polymers. It is 
difficult to ascertain whether the bulk has the same structure as the 
single crystals or represents a different kind of aggregation solely on the 
basis of direct structural examinations. Consequently, novel lines of 
approach are essential. Besides its intrinsic interest from the point of 
view of irradiation chemistry, we hope that this exploratory work will 
be relevant to the problem of polymer morphology. 


Il. EXPERIMENTAL DETAILS AND RESULTS 


2.1 Specimens 


The material studied was the linear polyethylene Marlex 50. Two 


kinds of preparations were used. In one case, the polymer was dissolved 


* A. Keller 
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in boiling xylene and crystallized isothermally at lower temperatures. 
Concentrations ranged from 0.02 to 0.05 per cent and crystallizations 
were conducted at 70° or 85°C. A suspension of single crystals formed 
which could readily be verified microscopically. The suspended material] 
was collected by filtering and was vacuum-dried. Low-angle X-ray 
scattering’ revealed that the fold lengths were 110 and 130 angstroms 
for crystallization temperatures of 70° and 85°C respectively. The second 
kind of material consisted of melt-crystallized flakes. Preceding irradia- 
tion, specimens were placed in Pyrex tubes, pumped in high vacuum 
overnight, and sealed. 


2.2 Irradiation 


Irradiations were conducted within vacuum-sealed tubes in a beam 
of 1 million electrou-volt electrons using a High Voltage Engineering Co. 
Van de Graaff accelerator. In order to minimize heating, samples were 
placed on a plate cooled by circulating cold water. The dose rate was 
determined by measuring the attenuation in current, with a Faraday 
cage, on interposing a known thickness of polyethylene in the electron 
beam. Moreover, current absorption by the tube was measured and 
accounted for. Dose rates of about 5 megarads per minute were ordinarily 
employed. The dose of 20 megarads was used for most comparative 
purposes for reasons to become apparent below, but a few experiments 
with other doses ranging from 2 to 128 megarads will also be quoted. 
Unless specifically stated, specimens were fused at 150°C while still 
sealed off after irradiation, which should lead to maximal changes in 
properties owing to the activation of the free radicals formed.®7 


2.3 Solubility 


Solubility determinations represent the principal measurements in 


this work. They were carried out by Soxhlet extraction with refluxing 


xylene, a treatment which fully extracts the unirradiated starting mate- 
rial. 

Bulk and single-crystal specimens showed very marked differences in 
their irradiation characteristics. When irradiated by our standard 20- 
megarad dose the bulk became much less soluble, leaving an insoluble 
gel fraction of 65-75 per cent. The single-crystal preparation irradiated 
with the same 20-megarad dose remained fully soluble, leaving no gel 
in the extraction thimble. Most of the work which followed was aimed 
at evaluating this remarkable difference. 

An investigation of the full dose-solubility characteristics could not 
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be fully accommodated within the present experimental program. Never- 
theless, we can quote a few data obtained with other doses. The radia- 
tion-solubility characteristics of the bulk have been known from some 
earlier work by one of us.* Even if conditions were not rigorously identi- 
cal with those in the present 20-megarad experiments, they are repre- 
sentative of the over-all trend. Accordingly, the gel point is roughly 2 
megarads; 10 megarads already gives a gel content of 60 per cent, rising 
to 70 per cent at 20 megarads and to 80 per cent at 40 megarads. In the 
case of the single crystals, a gel content of 26.5 per cent was obtained 
with a dose of 48 megarads. A dose of 85 megarads yielded 76 per cent, 
and one of 128 megarads, 86 per cent gel. The last two irradiations were 
carried out under somewhat altered operational conditions of the gen- 
erator, which necessitated leaving the specimens in air. Even if not 
strictly comparable with the “standard” experiment, these figures 
certainly bear out the fact that at higher doses a limiting value compar- 
able with that of the bulk is reached. 

To sum up the over-all trend: the bulk shows the first signs of in- 
solubility at less than one-tenth of the dose needed to produce the first 
gel in the single crystals. It becomes two-thirds to three-quarters in- 
soluble at a dose where the solubility of the single crystals is still un- 
affected, but at sufficiently high doses both specimens converge to 
approximately the same limiting solubility. 

When a single-crystal preparation was fused prior to irradiation, the 
20-megarad dose resulted in 71.4 per cent gel; consequently, the sample 
behaved as the bulk material did. This means that the change in solu- 
bility is due to the difference in the morphology alone and not to some 
change introduced in the course of specimen preparation. No obvious 
change in the X-ray pattern could be observed on irradiation within the 
dose range studied in either type of specimen. Further, the irradiated 
and still soluble single crystal (irradiated to our standard 20-megarad 


dose), dissolved thereafter, showed no significant difference in its subse- 


quent crystallization behavior from that of similarly treated unirradiated 


material. 


2.4 Viscosity Measurement 


The intrinsic viscosities of two single-crystal preparations, one before 
and one after irradiation with 20 megarads, were compared in dilute 
xylene solution at 100°C. The results were 1.42 deciliters per gram for 


* R. Salovey. 
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the unirradiated and 1.38 for the irradiated specimens, the difference 
considered to be within experimental error 


2.5 Melting Behavior 


It was found that the melting behavior was largely dominated by 
rate effects unless special precautions were taken to heat the samples 


slowly. We were not aware of this in the first experiments, which con- 
sisted of placing small fragments of irradiated and unirradiated poly- 
ethylene side by side on a coverglass lying on a temperature gradient 


bar. The coverglass was moved gradually along the bar towards higher 
temperatures. Since the unirradiated single-crystal preparation goes 
through a series of pronounced premelting and recrystallization effects, 
specimens were first fused once so that the initial morphology was 
identical in all of them. The irradiated bulk material retained its shape 
even after fusion, so that the melting point could not be readily deduced 
from the consistency alone. The irradiated single-crystal specimen, 
however, was found to melt 4 to 5°C lower (at 130 to 132°C) than the 
corresponding unirradiated samples (135 to 136°C) in this test. It is 
worth noting that the consistency above the melting point proved to be 
uniquely related to the solubility. Irradiated specimens which did not 
flow when heated to 140 to 150°C invariably proved to be largely in- 
soluble. Thus, the solubility results could always be predicted from the 
appearance of the sample when fused in the course of our standard 
treatment after irradiation. 

More accurate melting-point measurements were determined by a 
precision dilatometric technique set up by C. J. Aloisio and 8. Matsuoka 
of Bell Telephone Laboratories. Heating rates were very slow, the entire 
procedure taking about seven days. Just below the melting point, the 
temperature was raised in very small increments, allowing the materials 
to recrystallize. The unirradiated single crystals showed pronounced 
recrystallization effects, i.e., remelting and recrystallization to higher 
densities, in contrast to the smaller effects evidenced by the irradiated 
single crystals (which had already been fused following irradiation). 
The melting point itself was the same for all four irradiated and un- 
irradiated specimens: 135.5 to 136°C. Accordingly, the previously ob- 
served melting-point depression in the irradiated single crystals was due 
to the relatively rapid rate of heating. We did not observe the melting- 
point depression of 4 to 5°C reported in the literature for bulk specimens 
irradiated with comparable doses.* After fusion, the dilatometer was 
cooled to 125°C, and the samples recrystallized at this temperature. 
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Both irradiated samples attained the previously observed specific 
volumes, in contrast to the unirradiated samples, which reached higher 
densities when cooled at 125°C from the melt. 


2.6 Electron Spin Resonance 


Electron spin resonance measurements were made with a Varian EPR 
spectrometer in order to assess the free-radical concentration resulting 
from irradiation. We compared a single-crystal preparation and a bulk 
specimen irradiated with 20 megarads. Irradiation was conducted at low 
temperatures by placing the specimen tubes on a block of solid carbon 
dioxide wet with acetone. In this case the samples were not fused imme- 
diately afterwards, as was done in the other experiments, but the tubes 
were dropped into liquid nitrogen. The specimens were subsequently 
warmed for successively increasing lengths of time, and spin resonance 
measurements were made (with the sample in an unirradiated part of 
the tube). 

Measurements at liquid nitrogen temperatures revealed broad absorp- 
tion peaks in both specimens. These were identical in position, shape, 
and intensity for single-crystal and bulk specimens. On exposure to room 
temperature the strength of the absorption decayed, while a multiple- 
line pattern emerged from the band. The decay was much faster for the 
single crystals, where the peak decreased by a factor of eight after two 
hours and by a further 40 per cent after a total time of 16 hours. The 
corresponding decrease in the bulk was only by a factor of two, taking 
place during the first two hours, and resulting in a seven-fold difference 
between the two specimens. On heating to 74°C the absorption in the 
bulk decreased by 60 per cent after half an hour, and a further decrease 
to one-third of this value occurred after 20 minutes at 120°C. The single- 
crystal spectrum remained essentially unchanged by this treatment. 
Thus, the two peaks became roughly comparable in the two specimens. 
The peaks remained unaltered in both specimens on fusion at 152°C 
for half an hour, but a further decrease by a factor of two occurred in the 
single crystals when they were kept fused overnight. The spectra per- 
sisted even when the tubes were opened. 

The initial decay could also be seen visually. Both specimens were 
brown after irradiation at low temperatures, but the color faded rapidly 
on warming to room temperature. 

We do not attempt any specific assignment of the band, particularly 


as in some detail our findings differ from those reported in literature.’: 


For the present purpose it suffices to conclude that the free radicals 
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responsible for the absorption were present in comparable concentration 
in both kinds of irradiated specimens, but showed different decay pat- 


terns in the two. 


2.7 Infrared Spectra 


We examined infrared absorption of single crystals and bulk specimens 
and corresponding samples irradiated with 20 megarads. There were only 
slight differences between the original bulk and single-crystal specimens. 


On irradiation the vinyl peaks (terminal unsaturation) at 910 and 990 
em! decreased in both specimens but to a lesser extent in the single 
crystals, where it decreased by a factor of two, in contrast to the bulk, 
where the decrease was by a factor of seven (as referred to the weak 
methyl band at 1370 em~!). There was a pronounced increase of the 
transvinylene band (internal unsaturation) at 965 em~!, which was prac- 
tically identical in the two samples (in the single crystals the increase was 
about 20 per cent larger). 


2.8 Hydrogen Evolution 


A bulk and a single-crystal specimen irradiated with 20 megarads 
were analyzed for free hydrogen with a mass spectrometer. Single crystals 
gave off 4.19 K 10~ mole hydrogen per mole of CH, and the bulk gave 
off 8.17 K 10-4. After this analysis the specimens were fused and ana- 
lyzed again for free hydrogen which might have been originally occluded. 
Only a slight increase was found, the final figures being 4.40 « 10~ 
mole per mole of CHe. in the single crystals and 8.28 & 10-* mole per 
mole of CHe in the bulk. The single-crystal preparation gave off notice- 
able amounts of larger fragments roughly comparable in volume to the 
hydrogen evolved. Thus, in this measurement the single crystals gave 
off about one-half the amount of hydrogen as did the bulk. 


Ill. DISCUSSION 


It was initially apparent from the solubility measurements that the 
samples erystallized from the melt and from dilute solution responded 
differently to irradiation. Here we shall discuss possible causes of this 
striking difference. The decrease in solubility on irradiation is usually 
associated with the formation of crosslinks leading to an infinite network. 
In fact, solubility is often used as a measure of crosslinking on the basis 
of Flory’s theory of network formation.'*:"' Accordingly, the first signs 
of insolubility are observed when an infinite network first forms. The 
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dose where this occurs is referred to as the gel point because insoluble 
material appears in the gel consistency. With increasing doses the rela- 
tive amount of gel increases and rapidly attains a high value. For a given 
crosslink concentration, the gel point is also a function of the molecular 
weight; shorter molecules lead to enhanced solubility. Moreover, the 
onset of gelation is somewhat retarded by the incidence of scission, which 
accompanies crosslinking in the irradiation of polyethylene.!” 

Our observations first raise the question whether or not the radiation 
has an intrinsically much smaller effect on the single crystals than on the 
bulk. For this, as for most of the later arguments, we concentrate on the 
specimens which received 20 megarads, since in this case the differences 
were the most clear cut; the single crystals remained soluble, while 
about 70 per cent of the bulk became insoluble. Certain references in the 
literature state that irradiation should have a much smaller effect on the 
crystalline than on the amorphous portions.®’ Accordingly, single 
crystals should be less affected. However, L. Mandelkern* has found 
that the effects of irradiation on the solubility of crystalline and amor- 
phous polymer depend on the radiation dose. In any case, the crystal- 
linity content of the bulk specimen is not sufficiently different from that 
of the single crystals (it is about 90 per cent crystalline) to account for 
our effect, quite apart from the ambiguities encountered in definitions 
of the amount of crystallinity. However, our spin resonance results 
indicate that the primary effect of the radiation is comparable in bulk 
and single-crystal specimens, the concentration of the alkyl radicals 
being about the same (also indicated by the apparent discoloration). It 
is true that, from the one comparison pursued, the hydrogen evolution 
in the single-crystal specimens is only half of that found in the bulk for a 
dose of 20 megarads. If this means that the radiation is only half as 
effective in introducing chemical changes in the single crystals, the single 
crystals should behave like the bulk irradiated to 10 megarads. But a 
dose of 10 megarads still renders the bulk material about 60 per cent 


insoluble. Consequently, a factor of two in the hydrogen evolution could 


not account for our observations. Thus, the possibility that the effect 
of the radiation on the single crystals is sufficiently less to account for 
the observed increase in solubility can be ruled out. 

Another alternative is that the scission-to-crosslinking ratio is higher 
in the single crystal, while the total effect of the radiation (scission plus 
crosslinking) remains comparable. 

Infrared results indicate that the terminal vinyl concentration is 


* Private communication. 
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higher in the irradiated single crystal than in the corresponding bulk. If 
scission occurs by disproportionation leading to vinyl groups, this, at 
first sight, may seem to support the suggestion of increased scission. 
However, the vinyl concentration is invariably found to go down on 
irradiation in spite of the assumed scission by disproportionation.® So 
does it also in both of our specimens, but less in the single crystals. Since 
the basic reason for the decrease in the vinyl concentration is obscure, 


differences in the relative amount of decrease cannot be safely invoked 


as evidence for differences in scission in the present argument. End- 
linking has been suggested to explain the terminal vinyl decay.” This 
would increase the size of the molecule and facilitate gelation, which 
would be in line with the more rapid lowering of the solubility in the bulk. 
However, whether this speculative factor could account for the large 
differences in the two specimens remains questionable. 

The possibility of scission leading to methyl end groups through 
hydrogen uptake is not usually considered. We could not observe any 
increase in methyl concentration from the infrared spectra, but owing 
to the relative weakness of the band this is not conclusive. As far as one 
can surmise, such a mechanism could consume part of the hydrogen 
evolved and explain why less hydrogen forms on irradiation of single 
crystals. However, this hydrogen deSciency could result from a recom- 
bination of the radical and hydrogen, which might occur more readily 
in the better-ordered single-crystal system. Thus, while the infrared 
evidence raises some interesting possibilities, it remains inconclusive as 
regards scission. Neither is there evidence for an increase in scission from 
other properties, such as viscosity or melting point. In fact, there is no 
evidence for scission at all other than the existence of a limiting solu- 
bility at high doses. Since this limiting solubility is comparable for both 
samples, it is unlikely that the amount of scission should be much differ- 
ent at the smaller doses. 

The theory of network formation on which all arguments in this field 
are based assumes that crosslinks are randomly distributed between 
different molecules, intramolecular crosslinks being ignored. If in two 
specimens the total number of crosslinks were the same, but in one of 
them part of the crosslinks were intramolecular, the latter one would 
evidence enhanced solubility. In other words, part of the crosslinks would 
not contribute to the formation of an infinite network, hence they would 
not contribute to the decrease of solubility. The existence of some intra- 
molecular crosslinks (“‘ringlinks’’?) has been suggested" to account for 
the lack of material balance on irradiating bulk specimens. Since single 
erystals consist of folded chains, intramolecular crosslinking would be 
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expected to be appreciable. In fact, it was with this expectation that the 


present investigation was undertaken. 


Besides the enhanced probability of intramolecular crosslinking in a 
folded structure, the probability of multiple intermolecular linking 
between a finite number of molecules in close proximity is also enhanced, 
in view of the extensive contacts between ribbons of folded molecules. 
Such crosslinks will not contribute to the formation of an infinite net- 
work, hence will be ineffective in lowering the solubility. The presence of 
these crosslinks should, however, show up in the viscosity. The lack of a 
significant change in our viscosity figures might be due to a counter- 
balancing of two effects: a reduction due to internal crosslinks making 
the molecule more compact and the increase of the molecular weight. 

Attractive and probable as the explanation based on intramolecular 
crosslinking may sound, no final confirmation could be brought to bear 
on the issue within the range of the present work. The salient point is 
that there is no direct chemical method for the assessment of cross- 
links. Usually the solubility criterion is used, and this is clearly restricted 
to intermolecular crosslinks. In fact, our experiments demonstrate the 
limitations of this criterion. The situation is identical with changes in the 
elastic modulus, which are also used as a measure of crosslinking,'? since 
this is also based on the formation of infinite networks through inter- 
molecular crosslinks. The simultaneous appearance of insolubility in 
the irradiated material and that of rubbery consistency on fusion are 
due to this reason. This leaves us with the possibility of studying internal 
configuration in greater detail, which would require extensive and careful 
work on solution properties beyond the scope of the present investigation. 

The fact that the limiting gel content is comparable for both samples 
in spite of the significant initial difference needs some comments. It is 
customary to characterize the scission/crosslinking ratio (p/q) by a 
plot of s + +s versus |r, where s is the soluble fraction and r is the 
dose: then the intercept along the ordinate gives p/q.4 Since this intercept 
will be determined by solubility figures at high doses, different curves 
tending to the same limiting value will give the same p/q. In such cases 
the same pq is assigned to the corresponding samples, and the differences 
at lower doses are attributed to differences in the type of initial molecular 
weight distributions. (The molecular weight distributions will tend to 
become identical — random — in the course of scission.) In our ease, how- 
ever, the molecular weight distribution is identical in the two samples. 
The fact that after fusion the single-crystal preparation showed the same 
radiation characteristics as the bulk proves that no molecular weight 
or distribution change sufficient to modify the radiation behavior could 
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have occurred during the preparation of the single crystals. This means 


that the practically identical p/q values obtained from the limiting 
solubility cannot characterize both samples; p/q must vary with the 
dose at least in one, if not both, specimens. 

The Charlesby-Pinner* treatment is based on the assumption that 
both crosslinking and scission are proportional to the dose received. We 
now suggest that this assumption could still be valid for the chemical 
changes induced, but that the effectiveness of these in altering the solu- 
bility might vary. We have seen that the initial solubility of the single- 
crystal specimen could be accounted for by assuming that part of the 
crosslinking is ineffective because of intramolecular (or multiple inter- 
molecular) crosslinks leading to a large effective p/q. With large doses, 
the effective p/q decreases to that corresponding to the bulk. Plausibly, 
this could be due to a decrease in the effectiveness of scission while the 
true p/q may remain unaltered. If scission occurred within a loop formed 
by an intramolecular crosslink (or multiple intermolecular link) the 
number of molecules would not increase; hence the cutting would be 
ineffective as regards lowering the solubility. The chance for scission to 
become ineffective in this way increases with crosslinking, hence with 
the dose. In the light of this picture, it appears even less probable that 
scission should be unusually high at low doses in the single-crystal 
preparation, which strengthens our contention that intramolecular cross- 
linking rather than unusually high scission should be responsible for the 
high solubility in the 20-megarad region. 

If the explanation based on intramolecular crosslinking proves to be 
true it would imply that a given molecular segment within the crystal 
in the bulk is in an environment of segments predominantly belonging to 
different molecules, in contrast to single crystals, where it would be 
partly in the vicinity of other parts of the same molecule. This would 
suggest that chain folding would be less frequent in the bulk, if present 
at all. Thus, it appears that this newly found radiation effect might 
contribute substantially to the present controversy as to whether or not 
folded molecules found in single crystals are also representative of the 
bulk. However, it will be shown in the following paper" that such conclu- 
sions would be premature. It will be demonstrated that the effect of the 
morphology intervenes in a more subtle way and that one cannot exclude 
folded structures on the present evidence. 
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The Influence of Crystallization Conditions 
on Radiation Effects in Polyethylene 
II. Crystallization from 


Concentrated Solutions 


By R. SALOVEY and A. KELLER 


(Manuscript received January 24, 1961 


Samples of Marlex 50 were crystallized from solution in the concentration 
range of 0.1 to 2 per cent and were irradiated with a dose of 20-megarad 
electrons. If crystallized at 70°C the materials were largely soluble, but if 
crystallization was conducted at 85°C the samples became two-thirds in- 
soluble at from 0.4 to 0.6 per cent concentration upwards. Morphological 
examination of samples crystallized from 1 per cent solution revealed that 
the mode of crystallization was lamellar in all cases, but in the case of 70°C 
crystallization the lamellae were largely separated while in the 85°C case 
they were grown together in a compacted mass. After the latter was broken 
up by ultrasonic treatment this sample stayed fully soluble after 20-megarad 
electron irradiation. This finding indicates that the crosslinks are formed 
predominantly at the lamellar interfaces tying the lamellae together when 
these are in close contact. This might imply that the radiation is most effec- 
tive at the sites where the chains fold. The present exploratory work shows 
the importance of the mode of packing of the lamellae, and indicates that 
this might be an important factor in distinguishing between the different 
morphologies observed in general. 


I, INTRODUCTION 


It was shown in the preceding paper! that polyethylenes crystallized 
from the melt and from dilute solution were affected differently by elee- 
tron irradiation, the former becoming insoluble at much lower doses than 
the latter. In addition to their inherent interest for radiation studies, 
these findings should also be of value for morphological investigations, as 
the morphological state of the specimen may be reflected in the effect 
of the radiation on solubility. As was concluded in the preceding paper, 
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the different behavior of the bulk and dilute solution crystallized speci- 
mens indicates different morphologies. In particular, the higher solubility 
of the irradiated solution crystallized specimens is readily explicable in 
terms of intramolecular crosslinks forming within the folded chains of 
such single-crystal preparations. Accordingly, the lower proportion or 
even absence of such intramolecular linkages in the bulk would indicate 
that the molecules are not folded in such specimens. This appears to 
support the view that we are dealing with two distinct states of matter 
in the two kinds of specimens, a point which is nowadays widely de- 
bated.23 

According to recent theoretical work by Lauritzen and Hoffman,! 
crystallization from dilute solution should oceur via chain folding and 
that from the melt through the traditional bundle-like aggregation, a 
transition occurring between the two as the concentration of the solution 
is increased. Although these authors have modified their view since and 
consider the possibility of a folded crystallization in all cases,* we ac- 
cepted their first suggestion as a working hypothesis at the time, particu- 
larly in view of the large differences found in the two specimens in our 
radiation work. In order to explore the possible transition between the 
two kinds of behavior we examined polymers crystallized from more 
concentrated solutions. This work again is of a purely exploratory nature 
subject to the same limitations mentioned in the introduction of the 


preceding paper. 


Il, EFFECT OF CONCENTRATION AND TEMPERATURE OF CRYSTALLIZATION 
ON THE SOLUBILITY 


2.1 Methods 


Our material was again the linear polyethylene Marlex 50. Specimens 
were crystallized at either 70° or 85°C from xylene solutions of varying 
concentrations. Crystallization at 70°C was very rapid for the con- 
centrated solutions, turbidity setting in nearly immediately on lowering 
the temperature of the solution to 70°C. All samples gave discrete low- 
angle reflections corresponding to spacings of 110 and 130 angstroms in 
the samples crystallized at 70 and 85°C respectively. Irradiation experi- 
ments were carried out under the same circumstances as in the preceding 
paper.' Only one dose size (that of 20 megarads) was used, since it was 
with this dose that the most pronounced differences between melt and 


dilute solution crystallized specimens were found. Solubility changes were 


* Private communication. 
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TABLE I—SouuBILiry OF POLYETHYLENE (CRYSTALLIZED AT VARIOUS 
CONCENTRATIONS AND TEMPERATURES) AFTER [IRRADIATION TO 
20 MEGARADS 


Gel fraction 
Concentration 


Crystallized at 70°C Crystallized at 85°C 


0.1% 

0.3 7.6% 
0.4 

0.6 

0.8 


2.4 
7 


2.0* 


* There was an undissolved residue left. The actual concentration must have 
been lower than 2 per cent. 


determined by using the same Soxhlet extraction method described 
previously.! All irradiations reported here were carried out under strictly 
identical conditions, samples under comparison having been irradiated 


simultaneously. 


2.2 Solubility Results 


The results are listed in Table I. As can be seen, the solubility re- 
mained practically unaffected in the whole accessible concentration range 
for specimens crystallized at 70°C, giving a very small gel content even 
for the most concentrated solution (the relatively high figure at 0.3 per 
cent is undoubtedly anomalous, but this does not affect the general 
conclusion). Samples crystallized at 85°C, however, showed a remarkable 
transition. For this reason the 85° series was closely examined. Here, the 
gel content was insignificant up to a concentration of 0.4 per cent, the 
material remaining essentially fully soluble. Beyond this stage however, 
the gel content rose suddenly to a value comparable with bulk crystal- 
lized samples irradiated to the same dose. 

According to these results, all the specimens crystallized at 70°C 
behaved in a similar manner independent of concentration. Since the 
single-crystal nature of specimens formed at low concentrations is estab- 
lished, there is no reason to expect any departure from this at the higher 
concentrations on the basis of the radiation results (although the per- 
sistence of single crystals to such high concentrations may appear 
surprising). The transition from full solubility to two-thirds insolubility 
with increasing concentration in the case of 85°C crystallization may 
appear to be consistent with the expected transition from single crystal 
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Fig. 1 Typical polyethylene dendrites formed at 70°C from a 1 per cent 
xylene solution, photographed in a mixture of xylene and acetone. (Phase-con 
trast microscopy ; magnification of original photograph 600% 


to fringed micelle; the dependence on the temperature of crystallization, 


however, would remain unexplained. * 


2.3 Microscopic Examinations 


Specimens crystallized from 1 per cent solution at 70° and 85°C were 
examined by phase-contrast microscopy. Figs. 1 and 2 were taken of a 
suspension formed at 70°C. Fig. 1 clearly represents dendritic single 


crystals; Fig. 2 needs some further comments. In another investigation 


by one of us in collaboration with Mitsuhashi® it was established that 
the consecutive terraces of a polyethylene crystal need not be grown 
together but can be spatially separated, the whole crystal resembling 
more a loose helicoid (tighter at the center than at the edges) than a 


* One could invoke some of the implications of the kinetic approach to crystal 
lization.* Accordingly, in concentrated solutions nucleation via inter- and intra 
molecular processes might compete. At high temperatures bundle-like erystalliza 
tion would be favored in accordance with the considerations of Lauritzen and 
Hoffman. At low temperatures (high supercoolings) the nucleation rate might be 
so high that several intramolecular nuclei might form within the same molecule 
(a possibility predicted by Frank and Tosi®) even before a bundle from different 
molecules could develop by the aggregation of different chains. This would result 
in further crystal growth occurring by an aggregation of tiny chain folded erystals. 
Frank and Tosi estimate a crystallization temperature of 70°C in xylene where 
such a process could set in. Thus, as it stands this could be in line with our ob 
servations. Further observations which followed, however, indicate that other 
factors are at play which make this proposed explanation questionable 
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Fig. 2 Multilayer dendrites in the same preparation as in Fig. 1. (Phase 
£ ; pre} 


contrast microscopy; magnification of original photograph 600X.) 


solid spiral pyramid. This could be directly confirmed by observing the 
crystals turning in the liquid, when the individual layers could be seen 
to be separated and diverging outwards from the center. The more 
complicated structure in Fig. 2 is essentially such a multilayer crystal 
in an unusual projection. When such crystals are dried down, this three- 
dimensional structure collapses, leading to obscured areas in the interior 
of the crystal. It follows that such confused central areas usually ob- 
served in dried-down crystals need not be indicative of a departure from 
the chain folded lamellar structure. The crystals in Figs. | and 2 were 
in suspension when photographed to avoid this obscuring effect. We 
conclude that the specimen crystallized at 70°C has the usual lamellar 
single crystal habit throughout. 

Fig. 3 shows a typical example of aggregates formed at 85°C. As seen, 
they are compact and nearly circular (although reminiscent of spheru- 
lites, they do not show a clear Maltese cross extinction pattern between 


crossed polaroids). In view of the marked difference in solubility after 
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Fig. 3 Typical compact crystal formations of polyethylene formed at 85°C 
from 1 per cent xylene solution; the crystals are in their mother liquor. (Phase 
contrast microscopy; magnification of original photograph 1000 X .) 


irradiation, we anticipated a difference in morphology between prepara- 
tions formed at 70°C and at 85°C. However, the specimen crystallized 
at 85°C also proved to be lamellar, the distinct fibrous structure corre- 
sponding to lamellae seen edge-on. The observation of loose lamellae 
protruding beyond the edges of the more compact mass also suggests 
that the aggregates in Fig. 3 consist of closely packed lamellae. Thus 
the morphology may not be basically dissimilar in the two specimens. 
Both would consist essentially of chain-folded lamellae; in the 70°C 


crystallization the lamellae within a crystal are separated by the liquid 
in the sense described above, while in the 85°C erystallization they are 


compacted in a solid mass. It would follow, therefore, that the difference 
in the radiation behavior may be connected with the mode of packing of 
the lamellae. We attempted,to separate these lamellae in samples crystal- 
lized at 85°C to see whether the high solubility of irradiated specimens 
formed at 70°C could be obtained. 
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2.4 Disintegration Experiments 


The disintegration of aggregates formed at 85°C (Fig. 3) was effected 


by means of ultrasonic radiation. Suspensions in xylene were exposed 


to radiation from an ultrasonic generator operating at 40 ke and 200 
watts. The first trials showed that disintegration of the aggregates 
occurred and samples were then subjected to treatments of increasing 
duration.* 


2.5 Microscopic Examination 


Increasing extents of disintegration are indicated in Fig. 4. It will 
be seen that the compact masses are broken up into smaller units con- 
sisting of fewer fibrous entities (lamellae seen edge-on). The full mor- 
phological implications will not be discussed here. However, we suggest 
that the layers in Fig. 4 are probably multilayers themselves (not the 
basic 100-angstrom ones), and point out the frequent occurrence of 
pyramidal fragments consistent with other observations in this field. * 
*.10 Besides the prominent structure in the photographs, a finer back- 
ground structure is also visible which increases with the duration of the 
ultrasonic treatment. This is partly a lamellar strueture of the basic 
100-angstrom thickness and partly a ribbon-like fibrillar structure which 
may have its origin in broken-up lamellar units. 


2.6 Solubility of Irradiated Disintegrated Specimens 


Suspensions of polyethylene prepared by crystallizing a | per cent 
solution at 85°C and treated ultrasonically for 4 and 8 hours were filtered 
and prepared for irradiation with a 20-megarad dose in the usual manner. 
The irradiation of four samples was carried out simultaneously: the two 
just mentioned, a third sample which was a portion of the above prepara- 
tion that received the eight-hour ultrasonic treatment but was fused 
subsequently, and a fourth sample that was crystallized under the same 
conditions but was not treated ultrasonically. When the samples were 
heated to 150°C after irradiation, while still in vacuo as part of the usual 
procedure,'! marked differences could be observed in the melting be- 
havior. The untreated and the ultrasonically treated but subsequently 
fused sample both retained their shape and showed no outward signs of 
melting, while the two remaining ultrasonically treated samples flowed 


* The geometry of the set-up available for these tests permitted only a very in 
efficient utilization of energy. Consequently the times to be quoted are unusually 
Te in comparison with results since obtained with other generators by one of us 

. Keller). Nevertheless, they give a comparative measure of the treatments 
ne <d by the different samples 
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Fig. 4 The same preparation as in Fig. 3, treated ultrasonically for (a) one 
and one-half hours, (b) four hours, and (c) eight hours. (Phase-contrast micros 
copy; magnification of original photograph 1060X 


as unirradiated polyethylene does. As described in the preceding paper, 


the absence of clear melting was indicative of intermolecular crosslinks 
and was invariably associated with appreciable insolubility. In fact, 
both ultrasonically treated and unfused specimens were fully soluble, the 
untreated sample giving a gel fraction of 60.9 per cent, and the sample 
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which received an ultrasonic treatment of eight hours and was subse- 
quently fused giving 27.8 per cent gel. 


Ill. DISCUSSION OF RESULTS 


The complete solubility of the two ultrasonically treated specimens 
indicated that the behavior characteristic of single-crystal preparations 
had been restored. If the differences between specimens showing complete 
solubility, on the one hand, and large insolubility, on the other, after 
a 20-megarad dose were due to inherently different molecular textures, 
this situation could not be altered by ultrasonic treatment. Clearly, 
bundle-like structures could not be changed into folded ones in this way. 
However, morphological observations suggest that both kinds of samples 
consist essentially of lamellae which are packed differently. Accordingly, 
if the lamellae are closely packed in the sample crystallized at 85°C 
crosslinking is more likely to occur between them. Such interlamellar 
crosslinks would be necessarily intermolecular and would lead to a 
decrease in solubility in agreement with our experimental results. This 
raises a further problem. Only a small fraction of the molecule (2 to 4 
per cent, the part in the folds) is in the lamellar interfaces. If crosslinking 
could occur with equal probability anywhere along the chain, there would 
only be an insignificant chance for it to occur at the interface linking 
two lamellae. On this basis, interlamellar links could not have the signifi- 
cant effect our results imply. It appears therefore that the folds may be 
“sensitive points” where crosslinking may occur with greater frequency 
than elsewhere. 

This situation is complicated by the fact that the gel fraction of the 
sample which was treated ultrasonically for eight hours and subsequently 
fused did not regain the value of 60.9 per cent given under exactly the 
same conditions by the untreated control (but only yielded about half 
of this quantity). Probably an appreciable amount of chain-cutting was 


caused by the ultrasonic radiation. Then the increase of the solubility 


of the two unfused ultrasonically treated specimens cannot be interpreted 
in terms of physical changes alone. A significant part of it must be due 
to the degradation of the molecule. The amount of chain-cutting is 
expected to depend on the time of ultrasonic treatment; the sample 
treated for four hours was still fully soluble on irradiation and the mini- 
mum treatment required might well be still shorter. This means that our 
treatment and consequent degradation were unnecessarily drastic. There- 
fore, perhaps a more complete reversibility of the gel content on fusion 
would still be compatible with full solubility of the specimen prior to 
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fusion. Fuller clarification of this point could not be accommodated in 
the present program. The present results have established the fact that 
the mode of aggregation of the lamellae can have a profound effect on 
the irradiation behavior. In view of scission during the ultrasonic treat- 
ment, it cannot be stated with certainty that the full difference in irra- 


diation behavior between the two specimens under question can be 


accounted for by the mode of aggregation of the lamellae. 

We have tacitly implied that the contact between layers is of the van 
der Waals type only. It is likely that the better compacting of the units 
in the specimen crystallized at 85°C is due to a few molecules running 
through two or more layers before folding back, or that some folds 
might interpenetrate. This situation is more likely to arise with con- 
centrated systems. If this were the case, these joining molecules would 
have to be cut for the lamellae to be separated, which would mean that 
some of the molecular degradation may be necessarily linked with the 
physical separation of the layers. 

We have seen that the packing of the layers is strongly influenced by 
the temperature of crystallization, layers grown at higher temperatures 
being more closely stuck or linked together. At this stage, we can offer 
no explanation for this behavior. 


IV. SOME GENERAL CONSIDERATIONS 


In Section III, discussion was restricted to the interpretation of 
differences between two kinds of solution crystallized specimens, one 
crystallized at 70°C and the other at 85°C from 1 per cent xylene solu- 
tion. The question arises as to whether the explanation cited could be 
extended to account for the differences between solution-crystallized 
specimens (which retained their full solubility after a 20-megarad dose) 
and bulk specimens (which became two-thirds to three-quarters insoluble 
after the same dose). In the light of present findings it can certainly be 
stated that this difference need not be associated with a basic difference 
in molecular texture, i.e., either with the presence of folded molecules 
or of bundles. In fact, in the case of specimens crystallized from 1 per 
cent solution at 85°C, no basic departure from single-crystal texture was 
found other than more compact packing of lamellae, perhaps arising 
from some attachment of these units. It may not be unreasonable, there- 
fore, to consider the possibility of the same effect being responsible for 
bulk behavior. This would imply that the bulk might consist of chain 
folded lamellae. Indeed, we would expect closer packing and stronger 
connectedness of such lamellae in the bulk. Such a picture of the bulk 
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would not contradict the results of the present radiation studies. On the 
contrary, it would be supported by analogy with the behavior of speci- 
mens crystallized from concentrated solutions as well as by existing 


evidence of lamellar structures in the bulk.'':? In view of the complexity 


of the subject, it is not intended to imply that the present evidence is in 
any way conclusive. (lor other views see, for example, Chiang and 
Ilory.*) 

In conclusion, the present exploratory work has drawn attention to 
limitations in the usual treatments of the effects of irradiation which 
consider random intermolecular crosslinking and random scission only. 
Moreover, it contributes to current morphological studies by indicating 
the importance of packing of crystal lamellae. Finally, radiation effects 
are shown to be sensitive indicators of the morphological state of the 
polymer, and thus should take their place among other structural tools. 
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Compression, Filtering, and 


Signal-to-Noise Ratio in a 
Pulse-Modulated System 


By kK. A. MARCATILI 


(Manuscript received April 19, 1961) 


Some aspects of the transmission of Gaussian pulses in a frequency-divi- 
ston multiplex system have been calculated. It is assumed that each channel 
transmitter includes an amplifier, which may be nonlinear, with input and 
output filters, and that cach channel receiver includes a linear amplifier 
with a thermal noise source at its input plus input and output filters. The 
influence that (a) transmitting amplifier compression, (b) maximally flat 
channel-dropping filters of first or second order, and (ec) distribution of 
filtering in the system have on the signal-to-noise ratio, time crosstalk, and 
frequency crosstalk is calculated. 


I. INTRODUCTION 


A proposed long distance waveguide system! will transmit a large 
number of wideband PCM channels via a single waveguide by means of 
a frequency multiplexing arrangement. In previous papers?* an idealized 
frequency multiplex system was analyzed. These papers considered a 
system in which each channel signal, consisting of on-off carrier pulses, 
arrives at a detector together with three unwanted signals: (a) time cross- 
talk, due to leading and trailing edges of neighboring pulses; (b) fre- 
quency crosstalk, which is the interference from neighboring channels; 
and (c) thermal noise* generated essentially in the first amplifier of the 
receiver. Since these unwanted signals cause errors in the reading of the 
pulses, the filtering characteristics are closely related to the probability 
of errors. It has been shown that for any given probability of error it is 
possible to design the filters in such a way that a very desirable result is 
achieved—namely, the simultaneous minimization of time spacing be- 
tween successive pulses, of frequency separation between neighboring 
channels, and of signal-to-noise ratio. 


‘ 


* Throughout the paper, when we talk about ‘‘noise’’ we mean thermal noise 
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These conclusions were derived by analyzing a linear system consist- 
ing of a transmitting filter and a receiving filter. Such a scheme does not 
give quantitative answers to three practical questions: 

i. Because RF power is expensive it is desirable to drive the trans- 


mitting amplifiers as hard as possible, and consequently they must 


operate in the nonlinear portion of their characteristic. What is the in- 
fluence of nonlinearity on the system’s performance? 

ii. Filters between transmitting and receiving amplifiers reduce the 
signal without changing the noise. Consequently, in order to increase 
signal-to-noise ratio it would be desirable to have no filter between those 
amplifiers. But then the frequency crosstalk would be prohibitively high. 
How much filtering before and after each amplifier makes a reasonable 
compromise? 

ili. All of the immediately available channel-dropping filters that 
connect transmitters and receivers with the transmission media‘ are 
approximately Butterworth filters of first or second order. Pulses belong- 
ing to one channel will lose part of their power in passing through neigh- 
boring channel-dropping filters. How is the frequency spacing of channels 
affected by this loss? 

The object of this paper is to answer those questions. More specifically, 
we want to find out the influences that compression, distribution of filter- 
ing with respect to the amplifiers, and the use of the available dropping 
filters have on time crosstalk, frequency crosstalk, and signal-to-noise 
ratio. The results extend those already derived previously .?* 

In Section II the system to be analyzed is described and the funda- 
mental formulas are introduced. The influences of neighboring channel- 
dropping filters and nonlinearity of amplifiers are described in Sections 
III and IV. Signal-to-noise ratio is evaluated in Section V, and some de- 
sign examples are given in Section VI. Section VII contains a résumé 
of results and conclusions. 


II. SYSTEM DESCRIPTION AND ANALYSIS 


The system to be analyzed, Fig. 1, is ideally phase-equalized. The ef- 
fect of imperfect equalization can be derived’ as a perturbation of the 
results obtained in this paper. 

Table I shows that the transfer characteristic of each transmitter, 
excluding the amplifier is Gaussian and that that of each receiver is 
given by the sum of two slightly displaced Gaussian functions, which 
approximates the characteristic of a maximally flat Butterworth filter 
of third order and is easy to handle mathematically. Quantitative rea- 
sons for the selection of these filters can be found in a previous paper.’ 
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Fig. | Block diagram of system (see definitions of filters in Table T) 


Qualitative reasons are that these filters make a good compromise for 
low time crosstalk and frequency crosstalk. In effect, in order to mini- 
mize the tails of the output pulses and consequently the time crosstalk, 
it would be best to have both filters Gaussian. On the other hand, in 
order to minimize frequency crosstalk, it would be best to have both 
filters maximally flat (Butterworth) of high order. A reasonable com- 
promise is achieved by making one of them Gaussian and the other 
maximally flat of high order. A transfer characteristic relatively easy to 
achieve without too much midband attenuation is that of a maximally 
flat filter of third order, which we adopt. 

If the transmitter and receiver are centered at the same frequency, 
the transmission through them measures the insertion loss through a 
channel. If they are centered at different frequencies, the transmission 
through them measures the frequency crosstalk. 

Each transmitter includes three filters and a nonlinear amplifier. The 
filter la preceding the amplifier and the combination of the two fol- 
lowing it, 18, are Gaussian. Each receiver consists of a linear amplifier 
between two filters 2a and 2y whose combined transfer characteristic is 
approximately maximally flat of third order. This scheme allows one to 
evaluate the effects of the amount of filtering preceding and following 


each amplifier. 
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The channel-dropping filters ly, 2y, 3y, and 4y connecting the trans- 
mission medium and transmitters to receivers are maximally flat of first 
or second order. No channel-dropping filters have been indicated in Fig. 
1 to the left of the transmitter nor to the right of the receiver, because 
such filters will not influence the transfer characteristic between trans- 
mitter and receiver. 

A Gaussian pulse modulating a carrier is the input to the system. The 
amplitude has been selected in such a way that the amplitude of the 
pulse reaching the nonlinear transmitter’s amplifier is unity. The en- 
velope of the output pulse reaching the detector has been calculated 
under the following assumptions: 

i. Of all the neighboring channels, only those with center frequencies 
immediately adjacent to the carrier influence the output pulse. 

ii. The transfer characteristics of the channel-dropping filters of the 
neighboring channels can be approximated by the first three terms of 
their power series expansions around the carrier frequency. 

The envelope E(t) of the pulse reaching the detector has been calculated 
in (79) of Appendix A: 

x . « a2 
E(t) =|>> E - — = + — : ; | (1) 
v=0 27g, (2Zmrge2)- ot 
Before detailing the exact meaning of each letter we will give a rough 
functional interpretation of this expression: ¢; and g» have to do with the 
presence of neighboring channel-dropping filters; if these filters had no 
influence g; and ¢g. would be infinity and the envelope /(¢) would be re- 
duced to 


E(t) =|Q0 Iw). (2) 
N ==() 


The infinite number of terms is due to the nonlinearity of the transmit- 
ting amplifier, since in general the output is given by a summation of 
powers of the input. Those powers are an edd number 2N + 1, and, if 
the term with N = 0 is the only one different from zero, the amplifier 
is linear; if only the terms with VN = Oand N = 1 are different from zero, 
the amplifier has compression of third order; if only the terms with 
N = 0, N = 1, and N = 2 are different from zero, the amplifier has 
compression of third and fifth order; and so forth. 
Now we define the symbols in (1): 


ee: Ty By npA/2)? By—4p2b(1—by? A? By)—By(t/T) 2 
N “UN 
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(rR )° 
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(10) 


a 
filter, 
bh = 


an approximately maximally flat filter of third order, 


0.346 is a parameter normalizing the bandwidth of a Gaussian 
1.3 and n = 1.61 are parameters that normalize the bandwidth of 


f, is the center frequency of the transmitter and the carrier frequency, 

fs is the center frequency of the receiver, 

f,; and f; are the center frequencies of neighboring channels, 

2F, is the transmitting bandwidth measured at half power assuming 
a linear amplifier, 

2F. is the receiving bandwidth measured at half power assuming a 


linear amplifier, 


QF i- is the 
transmitting 
2F ig is the 
transmitting 
2Fi, is th 
channel-droy 


» half-power bandwidth of the Gaussian filter preceding the 
amplifier, 
» half-power bandwidth of the Gaussian filter following the 
amplifier, 
e half-power bandwidth of the transmitting Butterworth 
ping filter, 
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2F.., is the half-power bandwidth of the receiving Butterworth chan- 
nel-dropping filter, 

q is the order of the Butterworth channel-dropping filter at the trans- 
mitting end, 

r is the order of the Butterworth channel-dropping filter at the re- 
ceiving end, 

2T is the width of the input Gaussian pulse measured 8.686 db down, 


(11) 


is the ratio of channel spacing to receiving bandwidth, 


and 


aT Fy 


A 
(gy 41 are the odd coefficients in the power series 
(14) 


that relates the input and output amplitudes 7; and vo of the nonlinear 
transmitting amplifier. 

Where double signs are indicated, the upper one is to be used if 
fs = 2f, — fx (neighboring channels at opposite side of carrier f;), and 
the lower one if f; = fy (neighboring channels at the same side of car- 
rier fi). 

If the transmitting amplifier is linear (a, = 0 for vy # 1) and there is 
no interference from neighboring channels (f; = f; = %), expression 
(1) measures the transmission through a channel (if fi = fe) or the fre- 
quency crosstalk (if f; # fe), as calculated previously.® 


Ill. THE INFLUENCE OF NEIGHBORING CHANNEL-DROPPING FILTERS ON 
SIGNAL-TO-NOISE RATIO AND CROSSTALK 


Assume: 
(a) Linear transmitting amplifier. Then 
Ay = 0 for N # 0, 


and only the first term of the summation (1) remains. 
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(b) Transmitter and receiver centered at the same frequency, 


fi = fe. (16) 


Call £,(t) the output envelope (1) when conditions (15) and (16) are 
satisfied, and normalize this output to the value /)(0) that measures the 
peak of the output (¢ 0) that would occur if the channels were in- 
finitely spaced, (f; = fs x). Then 


: | rs I 0 } t/T)2B, cog MA Bun! 
Ex(t) _- Qngn)? OF i 


EO) | (l+ S)(1 + R) 


1 oa t/T)2B, uA Bont 2\1 
if cos aa 
2r¢, Ot 1 


(1+ S)(1 + R) 








2B 
l ——. [1 — 4 (pAn)?By 
E\(0) (2rgeT )? | pane | (18) 


E,(0) V(14+ S8)(1+ R) 





The insertion loss in db due to the proximity of neighboring channel 
dropping filters is measured by 


h£(0) 


20 log E,(0)’ 
44 


This function has been plotted for maximally flat channel dropping 
filters of first order (q = r 1) in Fig. 2 and for maximally flat filters 


of second order (q = r = 2) in Fig. 3. The abscissas 


are proportional to channel spacing. The rows correspond to different 
ratios F.,/F, between the bandwidth of the channel-dropping receiving 
filter and the bandwidth of the receiver. The different columns corre- 
spond to different ratios F,,/F, between the bandwidth of the channel 
drepping transmitting filter and bandwidth of the transmitter. The solid 
lines correspond to the ratio between the transmitting and the receiving 
bandwidths F,/F. = 1.3, the dashed lines to the ratio F';/ F» 1; the 
upper solid and dashed lines in each set of curves correspond to TF) = 
0.5 and the lower solid and dashed lines correspond to #7'F; 1. We 
have chosen not to include as another variable the selection of signs in 
the expression for g: , (8), because its effect is small. The two possible 
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FIRST-ORDER 


FILTERS (q=r=1)- 
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|f, — fs | - | f,-f,| 
3F2 2F2 
E, (0) 


ORDINATES : 20 LOG 
E,(0) 





ABSCISSAS : p= 


Fig. 2 Insertion loss due to first-order neighboring channels. 
choices of signs correspond to frequencies f; and f; of the neighboring 
channels on the same or different sides of f, . The first case introduces 
less insertion loss, but for the ranges of values selected in Figs. 2 and 3, 
and for small abscissas, the difference in ordinates for the two cases is 
smaller than 0.5 db. As a compromise, Figs. 2 and 3 have been calculated 
by averaging the results. 
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SECOND-ORDER 
FILTERS (Q =r =2) 
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|f,- f;| ms f,| 
2F2 2F, 
E,(0) 


ORDINATES : 20 LOG 
E,(0) 





ABSCISSAS : 9 = 


Fig. 3 Loss due to second-order neighboring channels. 


Let us concentrate on Fig. 2, (q r 1; that is, channel-dropping 
filters of first order). What parameters yield a system with low insertion 
loss due to neighboring channels? For a given abscissa (normalized chan- 
nel spacing p), the smallest ordinate occurs in the lowest line in the upper 
left set of curves. This line is characterized by 


Fi, = Fo = F, Fy ; rT F, 
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As could be expected, the system should have 
(a) long input pulses, 
(b) wide and equal bandwidths in transmitters and receivers, 
(ce) narrow and equal channel-dropping filters at transmitting 
receiving ends. 
In order to compare Figs. 2 and 3 we compute an example: 
For 
Fi, 
7 


F, F ais (f —t) 
“= 15, «TF; =1, and p = ~~ = 


° 2F. 


the insertion loss due to neighboring channels is 1.25 db if g = r = 1 in 
Fig. 2 (channel-dropping filter of first order) and 0.25 db if g = r = 2in 
Fig. 3 (channel-dropping filter of second order). Second-order filters 
make the insertion loss due to neighboring channels 1 db better than 
that of first-order filters. But this does not necessarily mean that the 
total insertion loss will be 1 db better in a practical case. There are two 
reasons for this statement: 

i. A second-order maximally flat filter, has more cavities and conse- 
quently more heat loss than does one of first order. We don’t know how 
much the difference is, but assuming it is 0.5 db, the increase of heat loss 
in the two channel-dropping filters is 1 db, and consequently the ad- 
vantage cited previously is only apparent. This point becomes more im- 
portant for smaller tolerable insertion loss due to neighboring channels 
(large p). 

ii. Consider the transmitter’s total Gaussian transfer characteristic 


r F,)2 
weer! u (19) 


and the transfer characteristic of its gth order channel-dropping filter 


l 


Y~—y = + f se hi 2° (20) 
4} Fi, 


The transfer characteristic of the rest of the filter should be y;/y;, . There 
are values of the frequency f for which this function is larger than unity, 





and therefore it is impossible to realize a passive filter with the desired 
transfer characteristic. The difficulty is solved by allowing the transfer 
characteristic of the transmitting filter to be not (19) but 

f f,)/F 4)? 


{i = ; (21) 
K 


where A is a constant greater than unity and equal to the maximum 
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value that y:/i1, can achieve; K expressed in db is an added insertion 
loss. 

The previous reasoning applies also to the receiver, but, since we are fi- 
nally going to be interested in signal-to-noise ratios, the value of K ap- 
plied to the receiver drops out. 

In order to have an idea of the values that K may achieve in Gaussian 


transmitters we have calculated some examples: 


20 log K 


0.3 db 
First-order maximally flat channel-dropping 
filter (q = r= 1): 


< 0.1 db 


Second-order maximally flat channel-drop- 


ping filter (q = r = 2): 
9 


In general, the broader the channel dropping filter and the flatter the 
transfer characteristic of the transmitter or receiver, the smaller the 
added insertion loss 20 log K. This added insertion loss can be neglected 
except when Fi, = F,andg =r = 1. 

Now we turn to the increase of time and frequency crosstalk that is 
introduced by the presence of neighboring channel dropping filters. It is 
shown in Appendix B that, for systems in which the increase of insertion 
loss due to neighboring channel-dropping filters is not very large, (around 
1 db), the increase of both time and frequency crosstalk is negligible. 


IV. INFLUENCE OF THE NONLINEARITY OF THE TRANSMITTING AMPLIFIER 
ON SIGNAL-TO-NOISE RATIO AND CROSSTALK 


Let us assume that, in Fig. 1, (a) the transmitting amplifier is driven so 
hard that the input signal is not linearly amplified, and (b) the presence 
of neighboring channel dropping filters can be neglected; this means that 
fs = fs = @ and, from (7) and (8), 


A 


1 g2 = @, (22) 


Call F2(t) the output envelope (1) when (22) is satisfied and normalize 
this output to the value /)(0) that measures the peak of the output 
(t = 0) that would occur if there were no compression (Ay = 0 for VN # 
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0), and with the transmitter and receiver centered at the same frequency, 


fi = fe. Then, 


’ x 
E(t) A N Ty By (npA/2)? (By—Bog)- 4p2b(1—bu? A2By)—By(t/T)? 


E,(0) a N-=-0 yn T V By ; 


(23) 


- cosh | 200 1 — by A°By) + inpABy | 


As explained in Appendix A, this expression has a simple physical 
interpretation. The Gaussian pulse of unit amplitude and width 


om a 
21 1+ GTR 

entering the nonlinear amplifier Fig. 1 produces an output that is a sum- 

mation of Gaussian pulses each characterized by the integer NV. The 

larger N is, the smaller are the amplitude and width of the corre- 

sponding pulse. Because of the linearity of the circuit following the 

amplifier, the envelope of the normalized output of the system (23) is 

the envelope of the sum of the transients produced by the Gaussian 
pulses. 
For 


(24) 


(23) measures the normalized maximum output intensity through a 
channel (p = 0) which, expressed in db, we call output compression : 


BE ;(0) : Ay Ty By nu A/2)2(By—B ™ 
20 l y — = 20 | Fi0 ] / 4 ‘ = ‘ (2! 
°8 (0) _ | * 2 AT VB ™ 


Naturally, if there were no compression (linear amplifier), Ay would be 
zero for all N and the output compression would be zero db. 

The amplifier compression characterizes the nonlinear amplifier. It 
measures in db the ratio between the amplitude of a sine wave input and 
the amplitude of the output without harmonics. It is derived from (25), 
(5), (6), and (13), making T = «: 


0log “4 — 20 log (1 Me as) (26) 
Ey(0) 1 Ao 

Assuming the summations in (25) and (26) to be small compared to 

unity, it is possible to expand the logarithms in series and to retain the 

first term; then the ratio between output compression and amplifier 

compression is 
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E,(0) + 1)By 
E,(0) = Aw 


log E.(0) >» 


1 Ao 


E3(0) “Ay / - fr \(npA/2)2 (By—Bo) 
log ] = »» 4 { 0 V (2N “DB ; 





Furthermore, if the amplifier has only compression of order 2N 
the previous expression becomes 
E3(0) 
E,(0) ; i N _ (npA/2)?(By—Bo 
Ex(0) VY ? 
6 E,(0) 


log 


This function has been plotted in Fig. 4. In abscissas we carry both 
(F,/Fs)?, the square of the ratio between the total transmitting band- 
width and the bandwidth of the filter after the amplifier, and F\./Fis , 
the ratio between bandwidth of the filters preceding and following the 
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Fig. 4 — Compression of output pulse due to nonlinearity of amplifier. 
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amplifier in the transmitting end. The two functions are related by the 


following expression: 
’ ’ 2 4 
Pia = | (f=) os | 
Fig Fy 


The upper and lower sets of curves correspond to different ratios be- 
tween transmitting and receiving bandwidths un = F,/F,. The solid 
lines correspond to amplifier compression of third order (2N + 1 = 3) 
and the dashed lines to amplifier compression of fifth order (2N + 1 = 5). 
The parameter 77'F, is proportional to input pulse width 27’, and trans- 
mitting bandwidth 2F; . 

Let us use these curves for an example. Suppose that the amplifier 
compression is —2 db. This means that the amplifier output is 2 db be- 
low what it would be if the amplifier were linear. Let us assume further 
that the amplifier’s compression is of third order (NV = 1) and that 

Py Fria 


= =1; —=1; Pry = 1. 
zi PF; ' Fig eigit 


For these parameters the upper curve in Fig. 4 yields 


output compression ica 
ra : 0.763 ’ 

amplifier compression 
and, since the amplifier compression is —2 db, the system output com- 
pression is —1.53 db. This means that the amplifier compresses more 
than the system of which it is a part. 

The output compression can be reduced by decreasing #7'F; and by 
incre: sing F\/F2, Fya/Fis , and N. In other words, for given input pulse 
width 7’, the output compression is reduced by placing as much filtering 
as possible after the amplifier and by using an amplifier that works like 
a limiter (large NV). These conclusions were to be anticipated if we recall 
that the compression of order 2N + 1 introduces a spurious Gaussian 
pulse in the output of the amplifier. Its effect on the output can be re- 
duced not only by filtering as much as possible but also by making that 


spurious pulse very narrow (large NV). Using the example elaborated 


previously and making N = 2, the output compression is reduced from 
—1.53 db to —1.4 db. 
Because of the compression of the top of the output pulse, time and 
frequency crosstalks are increased by 
E;(0) 


20 log — lb, 
"8 R0) 
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and the signal-to-noise ratio is reduced by the same amount with respect 
to the values that would be obtained if the amplifier were linear. 

The crosstalks are different from the linear case, not only because of 
the compression of the top of the output pulse but also because the tails 
of the pulses are changed, thus modifying the time crosstalk (extra time 
crosstalk), and because the narrow spurious pulses have a wide spectrum 
that change the frequency crosstalk (extra frequency crosstalk). It is 
shown in Appendix C that in computing time crosstalk it is slightly con- 
servative not to take into account the extra time crosstalk, and in com- 
puting frequency crosstalk it is conservative to ignore both the output 
compression and the extra frequency crosstalk. 


V. SIGNAL-TO-NOISE RATIO 


We want to calculate the maximum signal-to-noise ratio in the system 
shown in Fig. 1. Assume (a) the transmitter and receiver tuned at the 
same frequency f; = f2, (b) the nonlinearity of the transmitting ampli- 
fier of order 2N + 1, and (c) small insertion losses (about 2 db) due to 
compression and proximity of neighboring channel-dropping filters. 

Then, except for a constant measuring the insertion loss of the trans- 
mission media, the signal-to-noise ratio in db is derived from (1) and 
(100) of Appendix D: 


o 


1(0) 1(Q) hy 
20 log : = 20 log E | + 20 log > 


oO 


Lal 
40 


in which the first term, 


20 log E 24 me 
o 0 


a nuA/2)2B 
9\i 1+ ne Boe *" 
20 log (*) \ ae 


us , oe Ga ae Be 
WoFo} 1 + : - ( : 3 —— ] 
(—2)" \Fo, abr Vb 


measures in db the reference signal-to-noise ratio in the system —- that 





is, the one that would be obtained if there were no compression and no 
neighboring channels. The other terms are already-known corrections. 
The second term measures the insertion loss due to neighboring channels 
(18) and is plotted in Figs. 2 and 3; the third term measures the in- 
sertion loss due to compression (25), and can be derived from Fig. 4. 
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Fig. 5 — Relation between tolerable crosstalk filtering and pulses. 


Before plotting the first term we are going to establish a functional 
relationship among several parameters. This relationship answers the 
following question: Given the tolerable time crosstalk at the detector of 
a system in which the input is a train of Gaussian pulses 27’ wide and 
7 seconds apart and the sampling time is 27,, what are the possible 
combinations of bandwidths 2F; and 2F,, of the Gaussian transmitter 
and maximally flat receiver respectively, that satisfy those demands? 
The answers have been given in Figs. 4 and 7 of Ref. 3. We reproduce 
them in a convenient way in Figs. 5(a) and 5(b). They correspond to 25 


and 30 db of time crosstalk per tail. The parameter for the solid curves 


1S 


oT = = 
5 


(short pulses) 
and for the dashed curves it is 
Re colt f 

2T = 195 (long pulses). 
Straight lines of constant ratios u» = F\/F. have been added for con- 
venience. The curves have been limited to the region between the straight 
lines « = 1 and uw = 1.5 because, as will be seen later, that is the region 
where the signal-to-noise ratios pass through desirable maxima. 
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Fig 6 t{eference signal to-noise ratio in db 


The values of F; , fF: , and T from the curves in Fig. 5 have been sub- 
stituted in (30), and this reference signal-to-noise ratio expressed in db 
has been plotted with arbitrary reference in Fig. 6. The first and second 
rows of curves correspond to receiving channel-dropping filters of order 
r = land r = 2 respectively. The first and second columns of curves 
correspond to 25 and 30 db of time crosstalk per tail. The solid lines 
correspond to shorter input pulses than do the dashed ones, and the 
parameters on each curve are different ratios between the bandwidths 


2F\g and 2F>2, of the filters between amplifiers and transmission mediums 


and the bandwidths 2F; and 2F, of transmitter and receiver. 
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Let us examine how sensitive the signal-to-noise ratio is to each 
parameter. 

Any curve in the first row (r = 1) and the corresponding one in the 
second (r = 2) differ only by a few tenths of a db, and therefore the sig- 
nal-to-noise ratio is fairly insensitive to the order r of the receiving 
channel dropping filter. 

Corresponding curves in the first and second columns (different time 
crosstalk) have similar ordinates. This means that crosstalk can be re- 
duced at the expense of bandwidth without changing signal-to-noise level. 
Signal-to-noise level is improved by changing from short input pulses 
(solid lines) to long input pulses (dashed lines), but the advantage is 
paid for in channel bandwidth. 

Finally, we find that signal-to-noise ratio is very sensitive to the 
parameter 

Fig F2, 
ee 
which is essentially proportional to the bandwidth of the filter connecting 
transmitting and receiving amplifiers. The signal-to-noise ratio grows 
by widening this bandwidth, but it cannot be widened indefinitely be- 
cause it increases the insertion loss due to neighboring channel-dropping 


filters, Figs. 2 and 3. Nevertheless, curves corresponding to 
Fis F 
F, F, 
have been included in Fig. 6 to show the maximum achievable signal-to- 
noise ratio. 


9. 
’ 


VI. DESIGN EXAMPLES 


Given 
(a) 1/r = 160 me, pulse repetition rate; 
(b) 7 = T,, equal sampling time and input pulse width; 
(ce) 25 db time crosstalk per tail; 
(d) 2 db, third-order amplifier compression ; 
(e) q = r = 1, channel-dropping filters of first order; 
calculate 27, 2F, , 2F 2, 2F ia, 2F ig , 2F 1, , 2F2,,| fi — fs |. In order to 


have low insertion loss due to neighboring channels and large signal-to- 


noise ratio we select 
FP, 
F. 


Fi, Fig F., . 


Mm 
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From (a), (b), (31), and Fig. 5(a), we obtain for short input pulses 
aTF, = 0.54, (33) 
2T = 3.125 millimicroseconds, (34) 
2F, = 2F, = 220 me. (35) 
From (c), (e), (33), and Fig. 6, the reference signal-to-noise ratio re- 
sults: 


Fig _ Fry _ Fm 


| 4.4 db for 


‘ F FP F, 
20 log EJ ‘d 
oO 0 " 


3.2 db for Fi, = Lo 


F, 


From (d), (31), (33), and Fig. 4, we calculate the insertion loss due to 
compression and consequent increase in time crosstalk per tail: 


~ ; i 
dea x 2=14 db tor 4 = 2 
E(0) _ Fy 


20 log - 
06 E;(0) 


Fy 


F, 


10.68 K 2 1.36 db for 


From (e), (31), (32), and (33), the insertion loss due to neighboring 
channels is given approximately in the upper dotted curves in the center 
and lower right set of curves in Fig. 2. The abscissa which determines 
the frequency occupancy fi — f; , should be selected as small as possible. 
We will elaborate two examples: 


Example 1. 
We select 


The insertion loss due to neighboring channels is 


~4db for 
20 | E00) 
20 log F(0) F, 
1.7 db for 


ra 


The total signal-to-noise ratio (29) derived from (36), (37), and (39) is 
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| 44—14—4= —I1db for Py = 
E(O) _ ry 
oO Fi, 


ra 


20 log 
3.2 — 136 — 17 = 0.14 db for 
Because of the better signal-to-noise ratio we adopt 


F,, 
F, 


and deduce from (32), (34), (35), (38), and (41): 


= 15 


2T = 3.125 millimicroseconds, 
= 220 me, 


= 2F,, = 330 me, 


“7 





= = 295 me, 


= 440 me. 


Example 2. 
We select 


fi = fs 
2F 


The insertion loss due to neighboring channels is (Fig. 2), 


23 db for Py 

20 log £(0) is 
20 lo = 

5 Fy(0) oa ‘ Fi, 

abd OF -=— 

Fy 


Then from (29), (36), (37), and (44), the total signal-to-noise ratio is 


4.4 - 1.4 — 1.3 - 1.7 db for Fi, 

(0) PF, 

20 log = 
o 


3.2 — 1.36 — 0.7 = 1.14 db for bs = 


Selecting the best signal-to-noise ratio, we adopt 
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F,, 
PF, 
and deduce from (32), (34), (35), (43), and (46): 
2T = 3.125 millimicroseconds, 
2F, = 2F. = 220 me, 
2F\, = 2Fig = 2F 2, = 440 me, 


2F; 





= 254 me, 


fi —fs 660 me. 


In the first example, the signal-to-noise ratio (40) and the channel 
spacing (42) are 0.14 db and 440 me respectively. In the second example 


these values, (45) and (47), are 1.7 db and 660 mc. The first example 
provides better frequency occupancy and the second provides greater 


spacing between successive repeaters. 


VII. RESUME OF RESULTS AND CONCLUSIONS 


The influence of (a) neighboring channel-dropping filters, (b) non- 
linearity of transmitting amplifiers, and (c) filtering preceding and fol- 
lowing the transmitting and receiving amplifiers on the signal-to-noise 
ratio, time crosstalk and frequency crosstalk in a frequency-division 
multiplex PCM system has been calculated. 

Neighboring channel-dropping filters reduce the reference* signal-to- 
noise ratio by increasing the signal insertion loss, Figs. 2 and 3. If these 
filters are of first order, the channel spacing is determined by the tolera- 
ble insertion loss they introduce and not by the frequency crosstalk, as 
calculated in Ref. 3. If these filters are of order higher than one, the chan- 
nel spacing may be determined by the tolerable insertion loss or by the 
tolerable frequency crosstalk, (Figs. 5 and 8 of Ref. 3), depending on 
which demands more channel spacing. Neighboring channel-dropping 
filters have no practical bearing on time crosstalk. 

Nonlinearity of the transmitting amplifiers decreases the reference 
signal-to-thermal-noise ratio and increases time crosstalk by the same 
amount, Fig. 4, but it is conservative to ignore its effect on frequency 
crosstalk. 

* Signal-to-noise ratio in the absence of neighboring channels and with linear 
amplifiers. 
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The tolerance to compression is increased by concentrating as much 
filtering as possible between the nonlinear transmitting amplifier and 
the detector. 

Even for compression as large as 2 db the increase in frequency and 
time crosstalk is smaller than the amplifier compression. This means that 
the system can be readily designed to take advantage of the greater 
power available from an amplifier operating in the nonlinear region. 

Finally, reducing the filtering between transmitting and receiving 
amplifiers increases the reference signal-to-noise ratio, Fig. 6, but it also 
increases the signal insertion loss due to the presence of neighboring 
channel-dropping filters, Figs. 2 and 3. 


APPENDIX A 


Transmission Through Nonlinear Amplifier and Filters 


Given a system consisting of five filters and a nonlinear amplifier, Fig. 
1, we want to calculate the envelope of the response to a certain input. 
The transfer characteristics of the cascaded filters are: 


Tie = ¢ us 1S ) 


Yis = 19 ) 


50) 





ol) 





where 


n 1.61, 


and fi, fo, fs, fs are the center frequencies of the different filters; 
2Fia, 2F is, 2F2, 2F1,, 2F 2, are their half-power bandwidths; and jr. 
and 3 are Gaussian transfer characteristics and #2 is the transfer char- 


acteristic of an approximately third-order maximally flat filter.’ 
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The characteristic y;, is the transfer function between two ports of a 
three-port channel-dropping filter*®* proposed for use in a long dis- 
tance waveguide communication system.' This filter 3y (see Fig. 1) has 
port 1 matched at all frequencies, and the transfer function between 
ports | and 3 is that of a phase-equalized maximally flat filter of order 
q, bandwidth 2F;, , and center frequency f; . Because of conservation of 
energy it follows that the phase equalized transfer function between 
ports | and 2 is (51). ys, is similar to ys, . 

Between the first two filters la and 18 in Fig. 1 there is a nonlinear 
amplifier. Its infinitely wide band output voltage vo(f) as a function of 
the input v,(t) is 


vo(t) = z- a,[v,(t)]’. 


The input to the system, 


a 


T)? . io 
pee cos 2rfil, (54) 
(xTF.,)? 


(t) = 4 1+ 


is a Gaussian pulse 27 seconds wide, at 8.686 db down, that modulates a 
carrier J; . The amplitude 


a 
1+ GTrD 
has been selected to make the maximum amplitude of the signal v,(t) 
reaching the amplifier unity. 

In order to evaluate the system’s output we start calculating the am- 
plifier’s output vo(t). This is achieved in (53) by expressing v;(¢) in terms 
of the input 7(¢) to the system 


Volt) = 2. a, | | gq ( f) that y2aft af] 


in which 


: T 
gf) = Vrs 4 es : 


(rT Fa)? if 
is the Fourier transform of 7(t) of (54). 
Substitut.ng (48) and (56) in (55), performing the integration, and 
assuming 
TF ,)? 


< 1, 


one derives the explicit value of the amplifier’s output : 
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a  ¥ 
y v t\ 
Volt) = > Ae kktiocni:—_—_——_————_—— r) 
4 one © 
(rTF,)? 
Since 


v . 
cos 2rfil 


where 


(58) becomes 


Vo | t ) = 


ma v ie 

a, fv v YY 
z: p f ¢ exp| — — cos (vy — 2r)2rfit. 
yah yes 2 AS a 1 


Y (4TF.)* 
A further simplification: assume that only frequencies in the neighbor- 
hood of f; are allowed to pass through the filters following the amplifier. 
Physically, this means that all harmonics of f; are filtered out; mathe- 
matically, this is translated by keeping in (61) only the terms in which 
y— 2r= +1. (62) 


Consequently, 


vo(t) = cos 2nfit Ave 78”, 


where 


T OTF.)? 
IN EI 


The output from the nonlinear amplifier v(¢) is, then, a summation of 


(65) 


Ty=T4 


Gaussian pulses each of amplitude Ay and width 27'y measured 8.686 
db down. 

Using vo(t) as input to the second filter 18 in Fig. 1, we calculate the 
Fourier transform of each term with the help of (56) and deduce the 
system’s output: 
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/ al x0 
Vr m (eT Ny (\f|—S1)1? 4 
e(t) = “Z 3 AyT y YipyeysyYsyé — 
- 0 Canep 


We perform 


a ay (08) 
a change of variable, 


f=eth, 


and assume 


Tn 1) 


ae 


Then the envelope of the output (66) becomes 


= 


K(t)=v~ . +S AyTy re ere BYU ayY 47 dg | , (69) 


where yis, Y2, Ys, 


, Yay are deduced from (49), (50), (51), (5 
(67) to be 


52). and 


(70) 


¢ se 
cosh n (71) 


Y3- 





| 1+ ( Fi, yr 
4 yg + fi = fs 


l 


F 4, - 
l - — : 
| Deer eet 


In order to solve the integral in (69), we simplify part of the integrand 





by noticing that, on account of the exponential functions, most of the 
contribution to the result comes from small values of g, and consequently 
Yay » Ys. Can be expanded in power series. We keep the first three terms 


, ¢g\ 
¥1 2 


Vii + S)(1 + R)’ 





where 





COMPRESSION, FILTERING, SIGNAL-TO-NOISE RATIO 
i = % 


act gs rR 
is2 ° Th’ 





fi—fs 
q(2qg + 1)8 r(2r > L)R meh qrSR 
2(1 + S) 2(1 + R) (1+ S)(1+ R) 


_ 3 (*, 9 = a(; =e 
2\1+S 2\1+ R 


Where double signs are indicated, the upper one is to be used if f; = 
. 2f1 — fs (neighboring channels at opposite sides of carrier f,), and the 
lower one if f; = f, (neighboring channels at the same side of carrier f;). 

Substituting (70), (71), and (74) in (69) and integrating, one obtains 
the output envelope 


E(t) = Zz: 
| N 





: ol | Fi, 1 js 

Ty — ana re fn ’ (79) 
21g, Ot (21rge)* dt? 

in which 


2/2) By—4p2b(1—by? A2 By) —(t/T)2B 
N N N 


a+S)1+R): 


‘cosh | 200 1 — bu’ A*By) + inpA By * | ’ 


l 





») T 


APPENDIX B 


Influence of Channel-Dropping Filters on Crosstalk 


We want to compare the frequency and time crosstalks of the system 
in Fig. 1, assuming a linear amplifier with the already known’ frequency 
and time crosstalk of the same system obtained assuming not only 
linear amplifier but also no neighboring channel-dropping filters. 

The maximum frequency crosstalk between neighboring channels is 
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deduced from (69), making 

Ay = Ofor N = 0 (linear amplifier), 

fi # fe (transmitter and receiver centered at neighboring frequen- 
cies), 

fa, fa # fi (fs and fy are the center frequencies of neighboring channel- 

dropping filters), 

¢t = 0 (in order to maximize the integrand). 
Then 


x0 
E\(0) = VwrAolo / "as “YraYYayYsy de. (84) 
a x 

We eliminate the effect of neighboring channel-dropping filters by 
assuming f; and f; to be infinitely large frequencies. Then 3, and ya, 
given in (72) and (73) become unity and 

noe 
E\(0) = VY rAol’o | ” al yay? dy. (85) 
° a 
This result has been calculated elsewhere*® and consequently is known. 
The reader can check that £,(0) is slightly smaller than /,(0), because 
the product y3,ys, in (84) is slightly smaller than one for values of ¢ for 
which the rest of the always-positive integrand contributes substantially 
to the result. Therefore, it is slightly conservative to ignore the effect 
of the neighboring channel-dropping filters on frequency crosstalk. 

In order to evaluate the influence of neighboring channel-dropping 
filters on time crosstalk, we must compare the maximum values of two 
transients during the sampling time. The first transient (17) is that of a 
pulse through a system with neighboring channel-dropping filters; the 
second transient is that of the same pulse through the same system ex- 
cept for the removal of the neighboring channel-dropping filters. The 
last transient is obtained from (17) making f; = fy; = ~, 


Eo(t) ; 2p uABnt 


- = ¢ COs - (S86) 
E£y(0) 1 


and the time crosstalk that is derived from this expression is known.* 

If the pulses must be closely spaced, the maximum time crosstalk 
must be fixed by the sloping part of the transients, and consequently 
the broadening of the pulse due to the presence of the neighboring 
channel-dropping filters indirectly measures the time crosstalk. For 
small values of ¢, (17) and (24) yield, after power series expansion, 


_ 2B(1 + nes?) |e a 


( 21) T)? 


E(t) a 


E,(0) — net} 
Lo } 


1—(1+ sn’ A°B)| I 
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‘“ > 


E,(t) 2 wus t 
~1 — (1 + 4n'pA’B)B —_. (88 
E.(0) ~ ll sie T? : 


Since expressions (87) and (88) have the same functional dependence, 
the transient of a system with neighboring channel-dropping filters is 
equivalent to that of a system without neighboring channel-dropping 
filters that operates with a. wider input pulse 


(89) 


27 = or + B! 


cs seed 
(29g, T )? ; 
It is possible to prevent any widening, 7; = 7, by making g = ~. 
This is achieved in (6) by selecting the center frequencies fs and f; of 
the neighboring channels at opposite sides of the center frequency f; of 
the channel in which the time crosstalk is being studied. But even if 
¢1 is finite, the relative pulse width increase (7/7) — 1 is negligibly 
small for typical cases. For example, if one selects the parameters of a 
system with 1 db insertion loss due to neighboring channel-dropping 
filters (center set of curves, Fig. 2): 


Fria f Fsq 
F, I 2 


it follows from (89) that 
0.025. 
APPENDIX C 


Influence of Nonlinearity of the Transmitting Amplifier on Frequency and 
Time Crosstalk 
The nonlinearity of the transmitting amplifier changes crosstalk by 
compressing the top of the output pulse and because the spurious pulses 
generated by the amplifier change the tail of the main output pulse 
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(extra time crosstalk) and part of their spectra is fed into neighboring 
channels (extra frequency crosstalk). 

The time crosstalk at the instant ¢ = ¢T7 is by definition the ratio be- 
tween the envelope of the channel output at that instant and the 
maximum output expressed in db. From (23), calling F(t) the value 
acquired by F(t) if p = 0 (fi = fe), it is 

E3(¢T ) 


20 log ROY > 20 log e®*** | cos nwA Bot 


r x my ve 
— 20 log t + Zz; st *. 4 * ee | (90) 
1 10 0 0 


+ 20 log} 1 + > < r z el nna i221] (By—Bo) 


1 


cos nA Byte | 


cos nuA Bot 


The first term is the main contribution to time crosstalk and has been 
computed elsewhere.’ The second term is the contribution due to output 
compression, 20 log [£3(0)/#o(0)|, (25), and the third term is what we 
called extra time crosstalk in the text. 

Assume: 

(a) only one term of nonlinearity (only compression of third, fifth, 
ete. order), 

(b) small output compression (2 db at most), 

(c) such a pessimistic combination of parameters that 


cos nuA Boe = cos mABye = 1. 


Then (90) becomes 
+2 E;(0) 
= es n—Se") 90 lew = . 82) 


E;(¢T ) 
; — 20 log é 


20 | 
°8 E;(0) E,(0) 


For the range of values of the different parameters given in Fig. 4 
and ¢ > 2, 
(93) 


and consequently it is a slightly conservative assumption to neglect 
the extra time crosstalk due to compression. 

Now we turn to frequency crosstalk. The maximum frequency cross- 
talk is by definition the ratio between £;(0), the maximum of the en- 
velope output in a neighboring channel, and £2(0), the maximum output 
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in the channel. £.(0) is calculated from (23), making f = 0, and E;(0) 
is also derived from (23), making ¢ = p = 0. The ratio expressed in db is 


Es\ 0) 


~~ = 20 log 6 78 4"F0) cosh 2np(1 — byu?A*Bo) 
E;(0) 5 0 


20 log 


x An Py By elnna 2)2 (By * 
N= Ao To 4 By 


1 


— 20 log [ ++ 


< Ay Tw By (uA /2)2 [n2 +(4pb)2] (By—Bo) 
20 ] y 1 ete a ae al i pF N 0 
i ox | t 2 Ao T V Bo 
cosh 2np(1 — by’ A*By) 
cosh 2np(1 — by?A?By) }° 


The first term is the main contribution to frequency crosstalk and it 
has been computed elsewhere.* The second term is the contribution due 
to output compression 20 log [£3(0)/F(0)|, (25), and the third term 
is the extra frequency crosstalk. 

Assuming that the amplifier has only one term of nonlinearity and 
that the output compression is small, the second term of (94) is simpli- 
fied to 


W ) ; Ph ee . ‘e ili 
20 log i sO » 2 5. ly 1 N By (npA/2)? (By—Bo 


E,(0) ~~ Ap Tr ‘V Bo 
and consequently (94) can be rewritten 


E.(0) 25 (1—bu2 A2B £42 
20 log Ex(0) = 20 log ¢ torb—bu" 4" B0) cosh 2np(1 — by? A*By) 
43 
E3(0) 


— 20 log B(0) 
40 


BAO). . censadtinenesy + 
+ 20 on + 2.3 log 05 (2H pb)? (By—Bo (96) 
40) 


cosh 2np(1 — bu A’ By ) 
cosh 2np(1 — by?A?By) } 


If the channels must be crowded, p is small, but in that case the extra 
frequency crosstalk can be expanded in series and the second and third 
terms tend to cancel each other. In the limit, for p = 0, 20 log [F2(0) 
E;(0)| = 0, as it should be, since by definition /£2(0) and E3(0) coalesce 
to the same value. 

On the other hand, for large values of p the extra frequency crosstalk 
(third term) becomes more important than the frequency crosstalk due 
to compression (second term), but then the total frequency crosstalk 


, 


20 log [F£2(0)/#;(0)| is negligibly small. 
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Consider, for instance, the following system, in which the parameters 
rT F, = 0.5, Fi/F. = 1.4, Fis = Fo = ©,N = 2, and 2 db of amplifier 
compression have been chosen to exaggerate the influence of nonlinearity 
on frequency crosstalk. Substituting these values in (96), 


E(0) 


20 | : 
og E;( 0) 


—29.1 + 1.04 — 2.9 = —30.96 (in db) for p 

—67.2 + 1.04 — 37.1 = —103.26 (in db) for p 
The first term corresponds to frequency crosstalk for a linear amplifier, 
the second term is the frequency crosstalk correction due to output 
compression, and the third term is the extra frequency crosstalk. For 
small p (p = 2) it is conservative to ignore the second and third terms. 
For large p (p = 3) the total frequency crosstalk is negligibly small. 
Therefore, for all values of p and for all practical purposes the total 
frequency crosstalk can be calculated as if the amplifier were linear. 


APPENDIX D 


Noise Level at the Detector. 


The distribution of the filtering before and after the first amplifier 
of the receiver, Fig. 1, is important because it determines the thermal 
noise power reaching the detector. 

If the noise from the amplifier is white and the power density is Wo 
watts per cycle, then the mean noise power received by the detector is 


o Wo [ Ya : df, (98 ) 


where 


Ye ((if|—fe)/ Fe]? ‘ Je — Ie 
Ya = — = * eosh n ™ oH (99) 


Yry 
is the transfer characteristic of the filter connecting the amplifier and the 
detector. This characteristic has been found as the ratio between ye, the 
transfer characteristic of the receiver (approximately third-order maxi- 
mally flat), and ye,, the transfer characteristic of the receiving channel- 
dropping filter (maximally flat of order r). 
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Performing the integration (98), the average noise power received by 


the detector is 
I= wy AY eli +1... 
/-~ Wo) 1 - -) — F.. (100) 
i | . [ay (F) =| ( Vb ) : 
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